Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 59400 |
| Missing cells | 46094 |
| Missing cells (%) | 1.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 118.9 MiB |
| Average record size in memory | 2.1 KiB |
Variable types
| CAT | 29 |
|---|---|
| NUM | 10 |
| BOOL | 2 |
Reproduction
| Analysis started | 2020-07-05 21:49:42.894963 |
|---|---|
| Analysis finished | 2020-07-05 21:50:33.163036 |
| Duration | 50.27 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
recorded_by has constant value "GeoData Consultants Ltd" | Constant |
date_recorded has a high cardinality: 356 distinct values | High cardinality |
funder has a high cardinality: 1897 distinct values | High cardinality |
installer has a high cardinality: 2145 distinct values | High cardinality |
wpt_name has a high cardinality: 37400 distinct values | High cardinality |
subvillage has a high cardinality: 19287 distinct values | High cardinality |
lga has a high cardinality: 125 distinct values | High cardinality |
ward has a high cardinality: 2092 distinct values | High cardinality |
scheme_name has a high cardinality: 2696 distinct values | High cardinality |
extraction_type_group is highly correlated with extraction_type and 1 other fields | High correlation |
extraction_type is highly correlated with extraction_type_group and 1 other fields | High correlation |
extraction_type_class is highly correlated with extraction_type and 1 other fields | High correlation |
management_group is highly correlated with management | High correlation |
management is highly correlated with management_group | High correlation |
payment_type is highly correlated with payment | High correlation |
payment is highly correlated with payment_type | High correlation |
quality_group is highly correlated with water_quality | High correlation |
water_quality is highly correlated with quality_group | High correlation |
quantity_group is highly correlated with quantity | High correlation |
quantity is highly correlated with quantity_group | High correlation |
source_type is highly correlated with source and 1 other fields | High correlation |
source is highly correlated with source_type and 1 other fields | High correlation |
source_class is highly correlated with source and 1 other fields | High correlation |
waterpoint_type_group is highly correlated with waterpoint_type | High correlation |
waterpoint_type is highly correlated with waterpoint_type_group | High correlation |
funder has 3635 (6.1%) missing values | Missing |
installer has 3655 (6.2%) missing values | Missing |
public_meeting has 3334 (5.6%) missing values | Missing |
scheme_management has 3877 (6.5%) missing values | Missing |
scheme_name has 28166 (47.4%) missing values | Missing |
permit has 3056 (5.1%) missing values | Missing |
amount_tsh is highly skewed (γ1 = 57.80779995) | Skewed |
num_private is highly skewed (γ1 = 91.93374999) | Skewed |
id has unique values | Unique |
amount_tsh has 41639 (70.1%) zeros | Zeros |
gps_height has 20438 (34.4%) zeros | Zeros |
longitude has 1812 (3.1%) zeros | Zeros |
num_private has 58643 (98.7%) zeros | Zeros |
population has 21381 (36.0%) zeros | Zeros |
construction_year has 20709 (34.9%) zeros | Zeros |
| Distinct count | 59400 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37115.131767676765 |
|---|---|
| Minimum | 0 |
| Maximum | 74247 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3730.9 |
| Q1 | 18519.75 |
| median | 37061.5 |
| Q3 | 55656.5 |
| 95-th percentile | 70564.05 |
| Maximum | 74247 |
| Range | 74247 |
| Interquartile range (IQR) | 37136.75 |
Descriptive statistics
| Standard deviation | 21453.12837 |
|---|---|
| Coefficient of variation (CV) | 0.5780156866 |
| Kurtosis | -1.201515029 |
| Mean | 37115.13177 |
| Median Absolute Deviation (MAD) | 18568.5 |
| Skewness | 0.00262253035 |
| Sum | 2204638827 |
| Variance | 460236716.9 |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 72310 | 1 | < 0.1% | |
| 49805 | 1 | < 0.1% | |
| 51852 | 1 | < 0.1% | |
| 62091 | 1 | < 0.1% | |
| 64138 | 1 | < 0.1% | |
| 57993 | 1 | < 0.1% | |
| 60040 | 1 | < 0.1% | |
| 33413 | 1 | < 0.1% | |
| 35460 | 1 | < 0.1% | |
| 45699 | 1 | < 0.1% | |
| 41601 | 1 | < 0.1% | |
| 43648 | 1 | < 0.1% | |
| 70263 | 1 | < 0.1% | |
| 68212 | 1 | < 0.1% | |
| 20442 | 1 | < 0.1% | |
| 23134 | 1 | < 0.1% | |
| 19036 | 1 | < 0.1% | |
| 29275 | 1 | < 0.1% | |
| 25177 | 1 | < 0.1% | |
| 27224 | 1 | < 0.1% | |
| 4695 | 1 | < 0.1% | |
| 6742 | 1 | < 0.1% | |
| 597 | 1 | < 0.1% | |
| 2644 | 1 | < 0.1% | |
| Other values (59375) | 59375 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 74247 | 1 | < 0.1% | |
| 74246 | 1 | < 0.1% | |
| 74243 | 1 | < 0.1% | |
| 74242 | 1 | < 0.1% | |
| 74240 | 1 | < 0.1% | |
| 74239 | 1 | < 0.1% | |
| 74238 | 1 | < 0.1% | |
| 74237 | 1 | < 0.1% | |
| 74236 | 1 | < 0.1% | |
| 74235 | 1 | < 0.1% |
| Distinct count | 98 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 317.6503846801347 |
|---|---|
| Minimum | 0.0 |
| Maximum | 350000.0 |
| Zeros | 41639 |
| Zeros (%) | 70.1% |
| Memory size | 928.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 20 |
| 95-th percentile | 1200 |
| Maximum | 350000 |
| Range | 350000 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 2997.574558 |
|---|---|
| Coefficient of variation (CV) | 9.436709989 |
| Kurtosis | 4903.543102 |
| Mean | 317.6503847 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 57.80779995 |
| Sum | 18868432.85 |
| Variance | 8985453.232 |
| Value | Count | Frequency (%) | |
| 0 | 41639 | 70.1% | |
| 500 | 3102 | 5.2% | |
| 50 | 2472 | 4.2% | |
| 1000 | 1488 | 2.5% | |
| 20 | 1463 | 2.5% | |
| 200 | 1220 | 2.1% | |
| 100 | 816 | 1.4% | |
| 10 | 806 | 1.4% | |
| 30 | 743 | 1.3% | |
| 2000 | 704 | 1.2% | |
| 250 | 569 | 1.0% | |
| 300 | 557 | 0.9% | |
| 5000 | 450 | 0.8% | |
| 5 | 376 | 0.6% | |
| 25 | 356 | 0.6% | |
| 3000 | 334 | 0.6% | |
| 1200 | 267 | 0.4% | |
| 1500 | 197 | 0.3% | |
| 6 | 190 | 0.3% | |
| 600 | 176 | 0.3% | |
| 4000 | 156 | 0.3% | |
| 2400 | 145 | 0.2% | |
| 2500 | 139 | 0.2% | |
| 6000 | 125 | 0.2% | |
| 7 | 69 | 0.1% | |
| Other values (73) | 841 | 1.4% |
| Value | Count | Frequency (%) | |
| 0 | 41639 | 70.1% | |
| 0.2 | 3 | < 0.1% | |
| 0.25 | 1 | < 0.1% | |
| 1 | 3 | < 0.1% | |
| 2 | 13 | < 0.1% | |
| 5 | 376 | 0.6% | |
| 6 | 190 | 0.3% | |
| 7 | 69 | 0.1% | |
| 9 | 1 | < 0.1% | |
| 10 | 806 | 1.4% |
| Value | Count | Frequency (%) | |
| 350000 | 1 | < 0.1% | |
| 250000 | 1 | < 0.1% | |
| 200000 | 1 | < 0.1% | |
| 170000 | 1 | < 0.1% | |
| 138000 | 1 | < 0.1% | |
| 120000 | 1 | < 0.1% | |
| 117000 | 7 | < 0.1% | |
| 100000 | 3 | < 0.1% | |
| 70000 | 1 | < 0.1% | |
| 60000 | 1 | < 0.1% |
| Distinct count | 356 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 928.1 KiB |
| 2011-03-15 | 572 |
|---|---|
| 2011-03-17 | 558 |
| 2013-02-03 | 546 |
| 2011-03-14 | 520 |
| 2011-03-16 | 513 |
| Other values (351) |
| Value | Count | Frequency (%) | |
| 2011-03-15 | 572 | 1.0% | |
| 2011-03-17 | 558 | 0.9% | |
| 2013-02-03 | 546 | 0.9% | |
| 2011-03-14 | 520 | 0.9% | |
| 2011-03-16 | 513 | 0.9% | |
| 2011-03-18 | 497 | 0.8% | |
| 2011-03-19 | 466 | 0.8% | |
| 2013-02-04 | 464 | 0.8% | |
| 2013-01-29 | 459 | 0.8% | |
| 2011-03-04 | 458 | 0.8% | |
| 2013-02-14 | 444 | 0.7% | |
| 2013-01-24 | 435 | 0.7% | |
| 2011-03-05 | 434 | 0.7% | |
| 2013-02-15 | 429 | 0.7% | |
| 2013-03-15 | 428 | 0.7% | |
| 2011-03-11 | 426 | 0.7% | |
| 2013-01-30 | 421 | 0.7% | |
| 2013-02-16 | 418 | 0.7% | |
| 2011-03-23 | 417 | 0.7% | |
| 2011-03-09 | 416 | 0.7% | |
| 2013-01-18 | 409 | 0.7% | |
| 2011-03-30 | 391 | 0.7% | |
| 2013-02-26 | 391 | 0.7% | |
| 2013-03-19 | 381 | 0.6% | |
| 2011-03-24 | 381 | 0.6% | |
| Other values (331) | 48126 | 81.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 139059 | 23.4% | |
| 1 | 129012 | 21.7% | |
| - | 118800 | 20.0% | |
| 2 | 103867 | 17.5% | |
| 3 | 52820 | 8.9% | |
| 7 | 12853 | 2.2% | |
| 4 | 10712 | 1.8% | |
| 8 | 9363 | 1.6% | |
| 6 | 6154 | 1.0% | |
| 5 | 6034 | 1.0% | |
| 9 | 5326 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 475200 | 80.0% | |
| Dash Punctuation | 118800 | 20.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 139059 | 29.3% | |
| 1 | 129012 | 27.1% | |
| 2 | 103867 | 21.9% | |
| 3 | 52820 | 11.1% | |
| 7 | 12853 | 2.7% | |
| 4 | 10712 | 2.3% | |
| 8 | 9363 | 2.0% | |
| 6 | 6154 | 1.3% | |
| 5 | 6034 | 1.3% | |
| 9 | 5326 | 1.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 118800 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 594000 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 139059 | 23.4% | |
| 1 | 129012 | 21.7% | |
| - | 118800 | 20.0% | |
| 2 | 103867 | 17.5% | |
| 3 | 52820 | 8.9% | |
| 7 | 12853 | 2.2% | |
| 4 | 10712 | 1.8% | |
| 8 | 9363 | 1.6% | |
| 6 | 6154 | 1.0% | |
| 5 | 6034 | 1.0% | |
| 9 | 5326 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 594000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 139059 | 23.4% | |
| 1 | 129012 | 21.7% | |
| - | 118800 | 20.0% | |
| 2 | 103867 | 17.5% | |
| 3 | 52820 | 8.9% | |
| 7 | 12853 | 2.2% | |
| 4 | 10712 | 1.8% | |
| 8 | 9363 | 1.6% | |
| 6 | 6154 | 1.0% | |
| 5 | 6034 | 1.0% | |
| 9 | 5326 | 0.9% |
| Distinct count | 1897 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 3635 |
| Missing (%) | 6.1% |
| Memory size | 928.1 KiB |
| Government Of Tanzania | |
|---|---|
| Danida | 3114 |
| Hesawa | 2202 |
| Rwssp | 1374 |
| World Bank | 1349 |
| Other values (1892) |
| Value | Count | Frequency (%) | |
| Government Of Tanzania | 9084 | 15.3% | |
| Danida | 3114 | 5.2% | |
| Hesawa | 2202 | 3.7% | |
| Rwssp | 1374 | 2.3% | |
| World Bank | 1349 | 2.3% | |
| Kkkt | 1287 | 2.2% | |
| World Vision | 1246 | 2.1% | |
| Unicef | 1057 | 1.8% | |
| Tasaf | 877 | 1.5% | |
| District Council | 843 | 1.4% | |
| Dhv | 829 | 1.4% | |
| Private Individual | 826 | 1.4% | |
| Dwsp | 811 | 1.4% | |
| 0 | 777 | 1.3% | |
| Norad | 765 | 1.3% | |
| Germany Republi | 610 | 1.0% | |
| Tcrs | 602 | 1.0% | |
| Ministry Of Water | 590 | 1.0% | |
| Water | 583 | 1.0% | |
| Dwe | 484 | 0.8% | |
| Netherlands | 470 | 0.8% | |
| Hifab | 450 | 0.8% | |
| Adb | 448 | 0.8% | |
| Lga | 442 | 0.7% | |
| Amref | 425 | 0.7% | |
| Other values (1872) | 24220 | 40.8% | |
| (Missing) | 3635 | 6.1% |
Length
| Max length | 30 |
|---|---|
| Median length | 6 |
| Mean length | 9.505824916 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 71835 | 12.7% | |
| n | 65112 | 11.5% | |
| i | 38011 | 6.7% | |
| e | 37464 | 6.6% | |
| 34673 | 6.1% | ||
| r | 27879 | 4.9% | |
| t | 23016 | 4.1% | |
| o | 22741 | 4.0% | |
| s | 17208 | 3.0% | |
| d | 15464 | 2.7% | |
| f | 15329 | 2.7% | |
| m | 15140 | 2.7% | |
| v | 13030 | 2.3% | |
| T | 12110 | 2.1% | |
| l | 11219 | 2.0% | |
| G | 10722 | 1.9% | |
| O | 10613 | 1.9% | |
| z | 9687 | 1.7% | |
| c | 9216 | 1.6% | |
| w | 7971 | 1.4% | |
| D | 7928 | 1.4% | |
| u | 7884 | 1.4% | |
| W | 7352 | 1.3% | |
| p | 6992 | 1.2% | |
| k | 6496 | 1.2% | |
| Other values (44) | 59554 | 10.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 436785 | 77.4% | |
| Uppercase Letter | 89705 | 15.9% | |
| Space Separator | 34673 | 6.1% | |
| Other Punctuation | 1322 | 0.2% | |
| Decimal Number | 803 | 0.1% | |
| Open Punctuation | 437 | 0.1% | |
| Close Punctuation | 431 | 0.1% | |
| Dash Punctuation | 323 | 0.1% | |
| Connector Punctuation | 167 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| T | 12110 | 13.5% | |
| G | 10722 | 12.0% | |
| O | 10613 | 11.8% | |
| D | 7928 | 8.8% | |
| W | 7352 | 8.2% | |
| C | 4679 | 5.2% | |
| R | 4454 | 5.0% | |
| H | 3462 | 3.9% | |
| M | 3135 | 3.5% | |
| K | 2962 | 3.3% | |
| A | 2920 | 3.3% | |
| S | 2653 | 3.0% | |
| I | 2471 | 2.8% | |
| B | 2057 | 2.3% | |
| N | 2026 | 2.3% | |
| P | 1984 | 2.2% | |
| U | 1877 | 2.1% | |
| V | 1795 | 2.0% | |
| L | 1472 | 1.6% | |
| F | 1386 | 1.5% | |
| J | 842 | 0.9% | |
| E | 444 | 0.5% | |
| Y | 233 | 0.3% | |
| Q | 111 | 0.1% | |
| Z | 16 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 71835 | 16.4% | |
| n | 65112 | 14.9% | |
| i | 38011 | 8.7% | |
| e | 37464 | 8.6% | |
| r | 27879 | 6.4% | |
| t | 23016 | 5.3% | |
| o | 22741 | 5.2% | |
| s | 17208 | 3.9% | |
| d | 15464 | 3.5% | |
| f | 15329 | 3.5% | |
| m | 15140 | 3.5% | |
| v | 13030 | 3.0% | |
| l | 11219 | 2.6% | |
| z | 9687 | 2.2% | |
| c | 9216 | 2.1% | |
| w | 7971 | 1.8% | |
| u | 7884 | 1.8% | |
| p | 6992 | 1.6% | |
| k | 6496 | 1.5% | |
| h | 5694 | 1.3% | |
| g | 3074 | 0.7% | |
| b | 2735 | 0.6% | |
| y | 2679 | 0.6% | |
| x | 565 | 0.1% | |
| j | 313 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 34673 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 434 | 99.3% | |
| [ | 3 | 0.7% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 429 | 99.5% | |
| ] | 2 | 0.5% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 783 | 59.2% | |
| . | 469 | 35.5% | |
| \ | 33 | 2.5% | |
| & | 26 | 2.0% | |
| ' | 11 | 0.8% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 167 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 793 | 98.8% | |
| 2 | 5 | 0.6% | |
| 1 | 2 | 0.2% | |
| 9 | 2 | 0.2% | |
| 4 | 1 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 323 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 526490 | 93.2% | |
| Common | 38156 | 6.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 71835 | 13.6% | |
| n | 65112 | 12.4% | |
| i | 38011 | 7.2% | |
| e | 37464 | 7.1% | |
| r | 27879 | 5.3% | |
| t | 23016 | 4.4% | |
| o | 22741 | 4.3% | |
| s | 17208 | 3.3% | |
| d | 15464 | 2.9% | |
| f | 15329 | 2.9% | |
| m | 15140 | 2.9% | |
| v | 13030 | 2.5% | |
| T | 12110 | 2.3% | |
| l | 11219 | 2.1% | |
| G | 10722 | 2.0% | |
| O | 10613 | 2.0% | |
| z | 9687 | 1.8% | |
| c | 9216 | 1.8% | |
| w | 7971 | 1.5% | |
| D | 7928 | 1.5% | |
| u | 7884 | 1.5% | |
| W | 7352 | 1.4% | |
| p | 6992 | 1.3% | |
| k | 6496 | 1.2% | |
| h | 5694 | 1.1% | |
| Other values (27) | 50377 | 9.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 34673 | 90.9% | ||
| 0 | 793 | 2.1% | |
| / | 783 | 2.1% | |
| . | 469 | 1.2% | |
| ( | 434 | 1.1% | |
| ) | 429 | 1.1% | |
| - | 323 | 0.8% | |
| _ | 167 | 0.4% | |
| \ | 33 | 0.1% | |
| & | 26 | 0.1% | |
| ' | 11 | < 0.1% | |
| 2 | 5 | < 0.1% | |
| [ | 3 | < 0.1% | |
| 1 | 2 | < 0.1% | |
| ] | 2 | < 0.1% | |
| 9 | 2 | < 0.1% | |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 564646 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 71835 | 12.7% | |
| n | 65112 | 11.5% | |
| i | 38011 | 6.7% | |
| e | 37464 | 6.6% | |
| 34673 | 6.1% | ||
| r | 27879 | 4.9% | |
| t | 23016 | 4.1% | |
| o | 22741 | 4.0% | |
| s | 17208 | 3.0% | |
| d | 15464 | 2.7% | |
| f | 15329 | 2.7% | |
| m | 15140 | 2.7% | |
| v | 13030 | 2.3% | |
| T | 12110 | 2.1% | |
| l | 11219 | 2.0% | |
| G | 10722 | 1.9% | |
| O | 10613 | 1.9% | |
| z | 9687 | 1.7% | |
| c | 9216 | 1.6% | |
| w | 7971 | 1.4% | |
| D | 7928 | 1.4% | |
| u | 7884 | 1.4% | |
| W | 7352 | 1.3% | |
| p | 6992 | 1.2% | |
| k | 6496 | 1.2% | |
| Other values (44) | 59554 | 10.5% |
| Distinct count | 2428 |
|---|---|
| Unique (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 668.297239057239 |
|---|---|
| Minimum | -90 |
| Maximum | 2770 |
| Zeros | 20438 |
| Zeros (%) | 34.4% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | -90 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 369 |
| Q3 | 1319.25 |
| 95-th percentile | 1797 |
| Maximum | 2770 |
| Range | 2860 |
| Interquartile range (IQR) | 1319.25 |
Descriptive statistics
| Standard deviation | 693.1163503 |
|---|---|
| Coefficient of variation (CV) | 1.037137833 |
| Kurtosis | -1.292440135 |
| Mean | 668.2972391 |
| Median Absolute Deviation (MAD) | 369 |
| Skewness | 0.462402085 |
| Sum | 39696856 |
| Variance | 480410.2751 |
| Value | Count | Frequency (%) | |
| 0 | 20438 | 34.4% | |
| -15 | 60 | 0.1% | |
| -16 | 55 | 0.1% | |
| -13 | 55 | 0.1% | |
| -20 | 52 | 0.1% | |
| 1290 | 52 | 0.1% | |
| -14 | 51 | 0.1% | |
| 303 | 51 | 0.1% | |
| -18 | 49 | 0.1% | |
| -19 | 47 | 0.1% | |
| 1269 | 46 | 0.1% | |
| 1295 | 46 | 0.1% | |
| 1304 | 45 | 0.1% | |
| -23 | 45 | 0.1% | |
| 280 | 44 | 0.1% | |
| 1538 | 44 | 0.1% | |
| 1286 | 44 | 0.1% | |
| -8 | 44 | 0.1% | |
| -17 | 44 | 0.1% | |
| 1332 | 43 | 0.1% | |
| 320 | 43 | 0.1% | |
| 1317 | 42 | 0.1% | |
| 1293 | 42 | 0.1% | |
| 1319 | 42 | 0.1% | |
| 1359 | 42 | 0.1% | |
| Other values (2403) | 37834 | 63.7% |
| Value | Count | Frequency (%) | |
| -90 | 1 | < 0.1% | |
| -63 | 2 | < 0.1% | |
| -59 | 1 | < 0.1% | |
| -57 | 1 | < 0.1% | |
| -55 | 1 | < 0.1% | |
| -54 | 1 | < 0.1% | |
| -53 | 1 | < 0.1% | |
| -52 | 2 | < 0.1% | |
| -51 | 2 | < 0.1% | |
| -50 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2770 | 1 | < 0.1% | |
| 2628 | 1 | < 0.1% | |
| 2627 | 1 | < 0.1% | |
| 2626 | 2 | < 0.1% | |
| 2623 | 1 | < 0.1% | |
| 2614 | 1 | < 0.1% | |
| 2585 | 1 | < 0.1% | |
| 2576 | 1 | < 0.1% | |
| 2569 | 1 | < 0.1% | |
| 2568 | 1 | < 0.1% |
| Distinct count | 2145 |
|---|---|
| Unique (%) | 3.8% |
| Missing | 3655 |
| Missing (%) | 6.2% |
| Memory size | 3.4 MiB |
| DWE | |
|---|---|
| Government | 1825 |
| RWE | 1206 |
| Commu | 1060 |
| DANIDA | 1050 |
| Other values (2140) |
| Value | Count | Frequency (%) | |
| DWE | 17402 | 29.3% | |
| Government | 1825 | 3.1% | |
| RWE | 1206 | 2.0% | |
| Commu | 1060 | 1.8% | |
| DANIDA | 1050 | 1.8% | |
| KKKT | 898 | 1.5% | |
| Hesawa | 840 | 1.4% | |
| 0 | 777 | 1.3% | |
| TCRS | 707 | 1.2% | |
| Central government | 622 | 1.0% | |
| CES | 610 | 1.0% | |
| Community | 553 | 0.9% | |
| DANID | 552 | 0.9% | |
| District Council | 551 | 0.9% | |
| HESAWA | 539 | 0.9% | |
| World vision | 408 | 0.7% | |
| LGA | 408 | 0.7% | |
| WEDECO | 397 | 0.7% | |
| TASAF | 396 | 0.7% | |
| District council | 392 | 0.7% | |
| Gover | 383 | 0.6% | |
| AMREF | 329 | 0.6% | |
| TWESA | 316 | 0.5% | |
| WU | 301 | 0.5% | |
| Dmdd | 287 | 0.5% | |
| Other values (2120) | 22936 | 38.6% | |
| (Missing) | 3655 | 6.2% |
Length
| Max length | 30 |
|---|---|
| Median length | 4 |
| Mean length | 5.91976431 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| D | 27595 | 7.8% | |
| W | 25849 | 7.4% | |
| E | 25389 | 7.2% | |
| n | 23868 | 6.8% | |
| a | 20998 | 6.0% | |
| e | 15500 | 4.4% | |
| i | 15053 | 4.3% | |
| A | 13668 | 3.9% | |
| r | 13377 | 3.8% | |
| t | 12904 | 3.7% | |
| 12673 | 3.6% | ||
| o | 12398 | 3.5% | |
| C | 10535 | 3.0% | |
| m | 9289 | 2.6% | |
| S | 6659 | 1.9% | |
| R | 6518 | 1.9% | |
| l | 6201 | 1.8% | |
| s | 6173 | 1.8% | |
| I | 6160 | 1.8% | |
| T | 5948 | 1.7% | |
| u | 5436 | 1.5% | |
| K | 5390 | 1.5% | |
| c | 4835 | 1.4% | |
| N | 4674 | 1.3% | |
| G | 4466 | 1.3% | |
| Other values (45) | 50078 | 14.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 169155 | 48.1% | |
| Uppercase Letter | 167438 | 47.6% | |
| Space Separator | 12673 | 3.6% | |
| Other Punctuation | 971 | 0.3% | |
| Decimal Number | 783 | 0.2% | |
| Dash Punctuation | 268 | 0.1% | |
| Connector Punctuation | 169 | < 0.1% | |
| Open Punctuation | 159 | < 0.1% | |
| Close Punctuation | 16 | < 0.1% | |
| Currency Symbol | 2 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| D | 27595 | 16.5% | |
| W | 25849 | 15.4% | |
| E | 25389 | 15.2% | |
| A | 13668 | 8.2% | |
| C | 10535 | 6.3% | |
| S | 6659 | 4.0% | |
| R | 6518 | 3.9% | |
| I | 6160 | 3.7% | |
| T | 5948 | 3.6% | |
| K | 5390 | 3.2% | |
| N | 4674 | 2.8% | |
| G | 4466 | 2.7% | |
| M | 4257 | 2.5% | |
| H | 3455 | 2.1% | |
| O | 3149 | 1.9% | |
| F | 3109 | 1.9% | |
| L | 2509 | 1.5% | |
| U | 2228 | 1.3% | |
| P | 1951 | 1.2% | |
| V | 1583 | 0.9% | |
| B | 796 | 0.5% | |
| J | 762 | 0.5% | |
| X | 356 | 0.2% | |
| Y | 245 | 0.1% | |
| Z | 129 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 23868 | 14.1% | |
| a | 20998 | 12.4% | |
| e | 15500 | 9.2% | |
| i | 15053 | 8.9% | |
| r | 13377 | 7.9% | |
| t | 12904 | 7.6% | |
| o | 12398 | 7.3% | |
| m | 9289 | 5.5% | |
| l | 6201 | 3.7% | |
| s | 6173 | 3.6% | |
| u | 5436 | 3.2% | |
| c | 4835 | 2.9% | |
| v | 4433 | 2.6% | |
| d | 4210 | 2.5% | |
| w | 3338 | 2.0% | |
| g | 2679 | 1.6% | |
| y | 1794 | 1.1% | |
| h | 1702 | 1.0% | |
| p | 1434 | 0.8% | |
| k | 1393 | 0.8% | |
| f | 802 | 0.5% | |
| b | 505 | 0.3% | |
| j | 482 | 0.3% | |
| z | 323 | 0.2% | |
| x | 14 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 12673 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 169 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 670 | 69.0% | |
| . | 238 | 24.5% | |
| & | 50 | 5.1% | |
| ' | 12 | 1.2% | |
| # | 1 | 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 780 | 99.6% | |
| 1 | 1 | 0.1% | |
| 4 | 1 | 0.1% | |
| 9 | 1 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 268 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 157 | 98.7% | |
| [ | 2 | 1.3% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| } | 13 | 81.2% | |
| ] | 2 | 12.5% | |
| ) | 1 | 6.2% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 2 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 336593 | 95.7% | |
| Common | 15041 | 4.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| D | 27595 | 8.2% | |
| W | 25849 | 7.7% | |
| E | 25389 | 7.5% | |
| n | 23868 | 7.1% | |
| a | 20998 | 6.2% | |
| e | 15500 | 4.6% | |
| i | 15053 | 4.5% | |
| A | 13668 | 4.1% | |
| r | 13377 | 4.0% | |
| t | 12904 | 3.8% | |
| o | 12398 | 3.7% | |
| C | 10535 | 3.1% | |
| m | 9289 | 2.8% | |
| S | 6659 | 2.0% | |
| R | 6518 | 1.9% | |
| l | 6201 | 1.8% | |
| s | 6173 | 1.8% | |
| I | 6160 | 1.8% | |
| T | 5948 | 1.8% | |
| u | 5436 | 1.6% | |
| K | 5390 | 1.6% | |
| c | 4835 | 1.4% | |
| N | 4674 | 1.4% | |
| G | 4466 | 1.3% | |
| v | 4433 | 1.3% | |
| Other values (27) | 43277 | 12.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 12673 | 84.3% | ||
| 0 | 780 | 5.2% | |
| / | 670 | 4.5% | |
| - | 268 | 1.8% | |
| . | 238 | 1.6% | |
| _ | 169 | 1.1% | |
| ( | 157 | 1.0% | |
| & | 50 | 0.3% | |
| } | 13 | 0.1% | |
| ' | 12 | 0.1% | |
| $ | 2 | < 0.1% | |
| [ | 2 | < 0.1% | |
| ] | 2 | < 0.1% | |
| ) | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| # | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 351634 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| D | 27595 | 7.8% | |
| W | 25849 | 7.4% | |
| E | 25389 | 7.2% | |
| n | 23868 | 6.8% | |
| a | 20998 | 6.0% | |
| e | 15500 | 4.4% | |
| i | 15053 | 4.3% | |
| A | 13668 | 3.9% | |
| r | 13377 | 3.8% | |
| t | 12904 | 3.7% | |
| 12673 | 3.6% | ||
| o | 12398 | 3.5% | |
| C | 10535 | 3.0% | |
| m | 9289 | 2.6% | |
| S | 6659 | 1.9% | |
| R | 6518 | 1.9% | |
| l | 6201 | 1.8% | |
| s | 6173 | 1.8% | |
| I | 6160 | 1.8% | |
| T | 5948 | 1.7% | |
| u | 5436 | 1.5% | |
| K | 5390 | 1.5% | |
| c | 4835 | 1.4% | |
| N | 4674 | 1.3% | |
| G | 4466 | 1.3% | |
| Other values (45) | 50078 | 14.2% |
| Distinct count | 57516 |
|---|---|
| Unique (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.077426692028794 |
|---|---|
| Minimum | 0.0 |
| Maximum | 40.34519307 |
| Zeros | 1812 |
| Zeros (%) | 3.1% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 30.04066001 |
| Q1 | 33.09034738 |
| median | 34.90874343 |
| Q3 | 37.17838657 |
| 95-th percentile | 39.13323954 |
| Maximum | 40.34519307 |
| Range | 40.34519307 |
| Interquartile range (IQR) | 4.08803919 |
Descriptive statistics
| Standard deviation | 6.567431846 |
|---|---|
| Coefficient of variation (CV) | 0.1927208854 |
| Kurtosis | 19.18703105 |
| Mean | 34.07742669 |
| Median Absolute Deviation (MAD) | 2.032511095 |
| Skewness | -4.191046455 |
| Sum | 2024199.146 |
| Variance | 43.13116105 |
| Value | Count | Frequency (%) | |
| 0 | 1812 | 3.1% | |
| 37.54090064 | 2 | < 0.1% | |
| 33.01050977 | 2 | < 0.1% | |
| 39.09348389 | 2 | < 0.1% | |
| 32.9727187 | 2 | < 0.1% | |
| 33.00627548 | 2 | < 0.1% | |
| 39.10395018 | 2 | < 0.1% | |
| 37.54278497 | 2 | < 0.1% | |
| 36.80248988 | 2 | < 0.1% | |
| 39.09837398 | 2 | < 0.1% | |
| 33.09034738 | 2 | < 0.1% | |
| 33.00503158 | 2 | < 0.1% | |
| 32.9780624 | 2 | < 0.1% | |
| 39.08887513 | 2 | < 0.1% | |
| 31.61952953 | 2 | < 0.1% | |
| 39.09309544 | 2 | < 0.1% | |
| 39.10530661 | 2 | < 0.1% | |
| 32.93668943 | 2 | < 0.1% | |
| 32.98751118 | 2 | < 0.1% | |
| 39.09087979 | 2 | < 0.1% | |
| 37.31425027 | 2 | < 0.1% | |
| 32.98478963 | 2 | < 0.1% | |
| 39.09143391 | 2 | < 0.1% | |
| 37.27435243 | 2 | < 0.1% | |
| 32.91986139 | 2 | < 0.1% | |
| Other values (57491) | 57540 | 96.9% |
| Value | Count | Frequency (%) | |
| 0 | 1812 | 3.1% | |
| 29.6071219 | 1 | < 0.1% | |
| 29.60720109 | 1 | < 0.1% | |
| 29.61032056 | 1 | < 0.1% | |
| 29.61096482 | 1 | < 0.1% | |
| 29.61194674 | 1 | < 0.1% | |
| 29.61250689 | 1 | < 0.1% | |
| 29.61276296 | 1 | < 0.1% | |
| 29.61344309 | 1 | < 0.1% | |
| 29.6168718 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40.34519307 | 1 | < 0.1% | |
| 40.34430089 | 1 | < 0.1% | |
| 40.32523996 | 1 | < 0.1% | |
| 40.32522643 | 1 | < 0.1% | |
| 40.32340181 | 1 | < 0.1% | |
| 40.32283237 | 1 | < 0.1% | |
| 40.32280453 | 1 | < 0.1% | |
| 40.3226251 | 1 | < 0.1% | |
| 40.32216902 | 1 | < 0.1% | |
| 40.32196593 | 1 | < 0.1% |
latitude
Real number (ℝ)
| Distinct count | 57517 |
|---|---|
| Unique (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -5.706032659626431 |
|---|---|
| Minimum | -11.64944018 |
| Maximum | -2e-08 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | -11.64944018 |
|---|---|
| 5-th percentile | -10.58554992 |
| Q1 | -8.540621305 |
| median | -5.02159665 |
| Q3 | -3.32615564 |
| 95-th percentile | -1.408872227 |
| Maximum | -2e-08 |
| Range | 11.64944016 |
| Interquartile range (IQR) | 5.214465665 |
Descriptive statistics
| Standard deviation | 2.946019081 |
|---|---|
| Coefficient of variation (CV) | -0.5162990219 |
| Kurtosis | -1.057616666 |
| Mean | -5.70603266 |
| Median Absolute Deviation (MAD) | 2.07002988 |
| Skewness | -0.1520365709 |
| Sum | -338938.34 |
| Variance | 8.679028427 |
| Value | Count | Frequency (%) | |
| -2e-08 | 1812 | 3.1% | |
| -6.98584173 | 2 | < 0.1% | |
| -3.79757861 | 2 | < 0.1% | |
| -6.98188419 | 2 | < 0.1% | |
| -7.10462503 | 2 | < 0.1% | |
| -7.05692253 | 2 | < 0.1% | |
| -7.17517443 | 2 | < 0.1% | |
| -6.99073094 | 2 | < 0.1% | |
| -6.9787555 | 2 | < 0.1% | |
| -6.99470401 | 2 | < 0.1% | |
| -2.49454559 | 2 | < 0.1% | |
| -6.9642576 | 2 | < 0.1% | |
| -2.50658954 | 2 | < 0.1% | |
| -6.99054864 | 2 | < 0.1% | |
| -2.48522658 | 2 | < 0.1% | |
| -2.4943533 | 2 | < 0.1% | |
| -6.96247516 | 2 | < 0.1% | |
| -6.98945622 | 2 | < 0.1% | |
| -6.95732845 | 2 | < 0.1% | |
| -6.95871592 | 2 | < 0.1% | |
| -6.99261144 | 2 | < 0.1% | |
| -6.99129411 | 2 | < 0.1% | |
| -7.17715478 | 2 | < 0.1% | |
| -2.50162744 | 2 | < 0.1% | |
| -1.793342 | 2 | < 0.1% | |
| Other values (57492) | 57540 | 96.9% |
| Value | Count | Frequency (%) | |
| -11.64944018 | 1 | < 0.1% | |
| -11.64837759 | 1 | < 0.1% | |
| -11.58629656 | 1 | < 0.1% | |
| -11.56857679 | 1 | < 0.1% | |
| -11.56680457 | 1 | < 0.1% | |
| -11.56450865 | 1 | < 0.1% | |
| -11.56432357 | 1 | < 0.1% | |
| -11.56231592 | 1 | < 0.1% | |
| -11.56228898 | 1 | < 0.1% | |
| -11.56161898 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2e-08 | 1812 | 3.1% | |
| -0.99846435 | 1 | < 0.1% | |
| -0.998916 | 1 | < 0.1% | |
| -0.99901209 | 1 | < 0.1% | |
| -0.99911702 | 1 | < 0.1% | |
| -0.9994692 | 1 | < 0.1% | |
| -0.99950651 | 1 | < 0.1% | |
| -0.99952232 | 1 | < 0.1% | |
| -1.00058519 | 1 | < 0.1% | |
| -1.0015208 | 1 | < 0.1% |
| Distinct count | 37400 |
|---|---|
| Unique (%) | 63.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| none | 3563 |
|---|---|
| Shuleni | 1748 |
| Zahanati | 830 |
| Msikitini | 535 |
| Kanisani | 323 |
| Other values (37395) |
| Value | Count | Frequency (%) | |
| none | 3563 | 6.0% | |
| Shuleni | 1748 | 2.9% | |
| Zahanati | 830 | 1.4% | |
| Msikitini | 535 | 0.9% | |
| Kanisani | 323 | 0.5% | |
| Bombani | 271 | 0.5% | |
| Sokoni | 260 | 0.4% | |
| Ofisini | 254 | 0.4% | |
| School | 208 | 0.4% | |
| Shule Ya Msingi | 199 | 0.3% | |
| Shule | 152 | 0.3% | |
| Sekondari | 146 | 0.2% | |
| Muungano | 133 | 0.2% | |
| Mkombozi | 111 | 0.2% | |
| Madukani | 104 | 0.2% | |
| Mbugani | 94 | 0.2% | |
| Hospital | 94 | 0.2% | |
| Upendo | 93 | 0.2% | |
| Kituo Cha Afya | 90 | 0.2% | |
| Mkuyuni | 88 | 0.1% | |
| Umoja | 84 | 0.1% | |
| Center | 83 | 0.1% | |
| Ccm | 81 | 0.1% | |
| Kisimani | 78 | 0.1% | |
| Ofisi Ya Kijiji | 76 | 0.1% | |
| Other values (37375) | 49702 | 83.7% |
Length
| Max length | 30 |
|---|---|
| Median length | 10 |
| Mean length | 10.96210438 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 98806 | 15.2% | |
| i | 52404 | 8.0% | |
| 49898 | 7.7% | ||
| n | 42148 | 6.5% | |
| e | 40985 | 6.3% | |
| w | 31669 | 4.9% | |
| K | 31385 | 4.8% | |
| o | 30247 | 4.6% | |
| u | 24217 | 3.7% | |
| M | 22040 | 3.4% | |
| l | 20954 | 3.2% | |
| m | 17631 | 2.7% | |
| h | 17215 | 2.6% | |
| s | 16775 | 2.6% | |
| r | 14143 | 2.2% | |
| g | 13014 | 2.0% | |
| t | 11573 | 1.8% | |
| k | 11046 | 1.7% | |
| S | 10752 | 1.7% | |
| b | 10438 | 1.6% | |
| d | 10389 | 1.6% | |
| y | 7784 | 1.2% | |
| z | 6300 | 1.0% | |
| c | 5044 | 0.8% | |
| N | 4880 | 0.7% | |
| Other values (50) | 49412 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 493422 | 75.8% | |
| Uppercase Letter | 105185 | 16.2% | |
| Space Separator | 49898 | 7.7% | |
| Decimal Number | 1680 | 0.3% | |
| Other Punctuation | 741 | 0.1% | |
| Dash Punctuation | 104 | < 0.1% | |
| Open Punctuation | 37 | < 0.1% | |
| Close Punctuation | 37 | < 0.1% | |
| Connector Punctuation | 24 | < 0.1% | |
| Modifier Symbol | 21 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 98806 | 20.0% | |
| i | 52404 | 10.6% | |
| n | 42148 | 8.5% | |
| e | 40985 | 8.3% | |
| w | 31669 | 6.4% | |
| o | 30247 | 6.1% | |
| u | 24217 | 4.9% | |
| l | 20954 | 4.2% | |
| m | 17631 | 3.6% | |
| h | 17215 | 3.5% | |
| s | 16775 | 3.4% | |
| r | 14143 | 2.9% | |
| g | 13014 | 2.6% | |
| t | 11573 | 2.3% | |
| k | 11046 | 2.2% | |
| b | 10438 | 2.1% | |
| d | 10389 | 2.1% | |
| y | 7784 | 1.6% | |
| z | 6300 | 1.3% | |
| c | 5044 | 1.0% | |
| p | 3584 | 0.7% | |
| j | 3494 | 0.7% | |
| f | 2303 | 0.5% | |
| v | 1058 | 0.2% | |
| x | 127 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| K | 31385 | 29.8% | |
| M | 22040 | 21.0% | |
| S | 10752 | 10.2% | |
| N | 4880 | 4.6% | |
| A | 3497 | 3.3% | |
| B | 3425 | 3.3% | |
| C | 2791 | 2.7% | |
| P | 2564 | 2.4% | |
| L | 2507 | 2.4% | |
| J | 2385 | 2.3% | |
| Y | 2005 | 1.9% | |
| T | 1926 | 1.8% | |
| I | 1851 | 1.8% | |
| H | 1623 | 1.5% | |
| R | 1620 | 1.5% | |
| Z | 1526 | 1.5% | |
| D | 1417 | 1.3% | |
| G | 1318 | 1.3% | |
| O | 1226 | 1.2% | |
| E | 1209 | 1.1% | |
| U | 1042 | 1.0% | |
| W | 910 | 0.9% | |
| F | 822 | 0.8% | |
| V | 404 | 0.4% | |
| Q | 53 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 49898 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 417 | 56.3% | |
| . | 175 | 23.6% | |
| / | 146 | 19.7% | |
| & | 2 | 0.3% | |
| \ | 1 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 104 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 507 | 30.2% | |
| 2 | 439 | 26.1% | |
| 3 | 152 | 9.0% | |
| 4 | 120 | 7.1% | |
| 7 | 106 | 6.3% | |
| 5 | 86 | 5.1% | |
| 6 | 80 | 4.8% | |
| 8 | 75 | 4.5% | |
| 9 | 70 | 4.2% | |
| 0 | 45 | 2.7% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 29 | 78.4% | |
| [ | 8 | 21.6% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 29 | 78.4% | |
| ] | 8 | 21.6% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 24 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 21 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 598607 | 91.9% | |
| Common | 52542 | 8.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 98806 | 16.5% | |
| i | 52404 | 8.8% | |
| n | 42148 | 7.0% | |
| e | 40985 | 6.8% | |
| w | 31669 | 5.3% | |
| K | 31385 | 5.2% | |
| o | 30247 | 5.1% | |
| u | 24217 | 4.0% | |
| M | 22040 | 3.7% | |
| l | 20954 | 3.5% | |
| m | 17631 | 2.9% | |
| h | 17215 | 2.9% | |
| s | 16775 | 2.8% | |
| r | 14143 | 2.4% | |
| g | 13014 | 2.2% | |
| t | 11573 | 1.9% | |
| k | 11046 | 1.8% | |
| S | 10752 | 1.8% | |
| b | 10438 | 1.7% | |
| d | 10389 | 1.7% | |
| y | 7784 | 1.3% | |
| z | 6300 | 1.1% | |
| c | 5044 | 0.8% | |
| N | 4880 | 0.8% | |
| p | 3584 | 0.6% | |
| Other values (27) | 43184 | 7.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 49898 | 95.0% | ||
| 1 | 507 | 1.0% | |
| 2 | 439 | 0.8% | |
| ' | 417 | 0.8% | |
| . | 175 | 0.3% | |
| 3 | 152 | 0.3% | |
| / | 146 | 0.3% | |
| 4 | 120 | 0.2% | |
| 7 | 106 | 0.2% | |
| - | 104 | 0.2% | |
| 5 | 86 | 0.2% | |
| 6 | 80 | 0.2% | |
| 8 | 75 | 0.1% | |
| 9 | 70 | 0.1% | |
| 0 | 45 | 0.1% | |
| ( | 29 | 0.1% | |
| ) | 29 | 0.1% | |
| _ | 24 | < 0.1% | |
| ` | 21 | < 0.1% | |
| [ | 8 | < 0.1% | |
| ] | 8 | < 0.1% | |
| & | 2 | < 0.1% | |
| \ | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 651149 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 98806 | 15.2% | |
| i | 52404 | 8.0% | |
| 49898 | 7.7% | ||
| n | 42148 | 6.5% | |
| e | 40985 | 6.3% | |
| w | 31669 | 4.9% | |
| K | 31385 | 4.8% | |
| o | 30247 | 4.6% | |
| u | 24217 | 3.7% | |
| M | 22040 | 3.4% | |
| l | 20954 | 3.2% | |
| m | 17631 | 2.7% | |
| h | 17215 | 2.6% | |
| s | 16775 | 2.6% | |
| r | 14143 | 2.2% | |
| g | 13014 | 2.0% | |
| t | 11573 | 1.8% | |
| k | 11046 | 1.7% | |
| S | 10752 | 1.7% | |
| b | 10438 | 1.6% | |
| d | 10389 | 1.6% | |
| y | 7784 | 1.2% | |
| z | 6300 | 1.0% | |
| c | 5044 | 0.8% | |
| N | 4880 | 0.7% | |
| Other values (50) | 49412 | 7.6% |
| Distinct count | 65 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.47414141414141414 |
|---|---|
| Minimum | 0 |
| Maximum | 1776 |
| Zeros | 58643 |
| Zeros (%) | 98.7% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1776 |
| Range | 1776 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 12.23622981 |
|---|---|
| Coefficient of variation (CV) | 25.80713147 |
| Kurtosis | 11137.29521 |
| Mean | 0.4741414141 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 91.93374999 |
| Sum | 28164 |
| Variance | 149.72532 |
| Value | Count | Frequency (%) | |
| 0 | 58643 | 98.7% | |
| 6 | 81 | 0.1% | |
| 1 | 73 | 0.1% | |
| 5 | 46 | 0.1% | |
| 8 | 46 | 0.1% | |
| 32 | 40 | 0.1% | |
| 45 | 36 | 0.1% | |
| 15 | 35 | 0.1% | |
| 39 | 30 | 0.1% | |
| 93 | 28 | < 0.1% | |
| 3 | 27 | < 0.1% | |
| 7 | 26 | < 0.1% | |
| 2 | 23 | < 0.1% | |
| 65 | 22 | < 0.1% | |
| 47 | 21 | < 0.1% | |
| 102 | 20 | < 0.1% | |
| 4 | 20 | < 0.1% | |
| 17 | 17 | < 0.1% | |
| 80 | 15 | < 0.1% | |
| 20 | 14 | < 0.1% | |
| 25 | 12 | < 0.1% | |
| 11 | 11 | < 0.1% | |
| 41 | 10 | < 0.1% | |
| 34 | 10 | < 0.1% | |
| 16 | 8 | < 0.1% | |
| Other values (40) | 86 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 58643 | 98.7% | |
| 1 | 73 | 0.1% | |
| 2 | 23 | < 0.1% | |
| 3 | 27 | < 0.1% | |
| 4 | 20 | < 0.1% | |
| 5 | 46 | 0.1% | |
| 6 | 81 | 0.1% | |
| 7 | 26 | < 0.1% | |
| 8 | 46 | 0.1% | |
| 9 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1776 | 1 | < 0.1% | |
| 1402 | 1 | < 0.1% | |
| 755 | 1 | < 0.1% | |
| 698 | 1 | < 0.1% | |
| 672 | 1 | < 0.1% | |
| 668 | 1 | < 0.1% | |
| 450 | 1 | < 0.1% | |
| 300 | 1 | < 0.1% | |
| 280 | 1 | < 0.1% | |
| 240 | 1 | < 0.1% |
basin
Categorical
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| Lake Victoria | |
|---|---|
| Pangani | |
| Rufiji | |
| Internal | |
| Lake Tanganyika | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| Lake Victoria | 10248 | 17.3% | |
| Pangani | 8940 | 15.1% | |
| Rufiji | 7976 | 13.4% | |
| Internal | 7785 | 13.1% | |
| Lake Tanganyika | 6432 | 10.8% | |
| Wami / Ruvu | 5987 | 10.1% | |
| Lake Nyasa | 5085 | 8.6% | |
| Ruvuma / Southern Coast | 4493 | 7.6% | |
| Lake Rukwa | 2454 | 4.1% |
Length
| Max length | 23 |
|---|---|
| Median length | 10 |
| Mean length | 10.8923569 |
| Min length | 6 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 107025 | 16.5% | |
| i | 57807 | 8.9% | |
| n | 50807 | 7.9% | |
| 49672 | 7.7% | ||
| e | 36497 | 5.6% | |
| u | 35883 | 5.5% | |
| k | 33105 | 5.1% | |
| t | 27019 | 4.2% | |
| L | 24219 | 3.7% | |
| r | 22526 | 3.5% | |
| R | 20910 | 3.2% | |
| o | 19234 | 3.0% | |
| g | 15372 | 2.4% | |
| y | 11517 | 1.8% | |
| v | 10480 | 1.6% | |
| m | 10480 | 1.6% | |
| / | 10480 | 1.6% | |
| V | 10248 | 1.6% | |
| c | 10248 | 1.6% | |
| s | 9578 | 1.5% | |
| P | 8940 | 1.4% | |
| f | 7976 | 1.2% | |
| j | 7976 | 1.2% | |
| I | 7785 | 1.2% | |
| l | 7785 | 1.2% | |
| Other values (7) | 33437 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 488262 | 75.5% | |
| Uppercase Letter | 98592 | 15.2% | |
| Space Separator | 49672 | 7.7% | |
| Other Punctuation | 10480 | 1.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| L | 24219 | 24.6% | |
| R | 20910 | 21.2% | |
| V | 10248 | 10.4% | |
| P | 8940 | 9.1% | |
| I | 7785 | 7.9% | |
| T | 6432 | 6.5% | |
| W | 5987 | 6.1% | |
| N | 5085 | 5.2% | |
| S | 4493 | 4.6% | |
| C | 4493 | 4.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 107025 | 21.9% | |
| i | 57807 | 11.8% | |
| n | 50807 | 10.4% | |
| e | 36497 | 7.5% | |
| u | 35883 | 7.3% | |
| k | 33105 | 6.8% | |
| t | 27019 | 5.5% | |
| r | 22526 | 4.6% | |
| o | 19234 | 3.9% | |
| g | 15372 | 3.1% | |
| y | 11517 | 2.4% | |
| v | 10480 | 2.1% | |
| m | 10480 | 2.1% | |
| c | 10248 | 2.1% | |
| s | 9578 | 2.0% | |
| f | 7976 | 1.6% | |
| j | 7976 | 1.6% | |
| l | 7785 | 1.6% | |
| h | 4493 | 0.9% | |
| w | 2454 | 0.5% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 49672 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 10480 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 586854 | 90.7% | |
| Common | 60152 | 9.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 107025 | 18.2% | |
| i | 57807 | 9.9% | |
| n | 50807 | 8.7% | |
| e | 36497 | 6.2% | |
| u | 35883 | 6.1% | |
| k | 33105 | 5.6% | |
| t | 27019 | 4.6% | |
| L | 24219 | 4.1% | |
| r | 22526 | 3.8% | |
| R | 20910 | 3.6% | |
| o | 19234 | 3.3% | |
| g | 15372 | 2.6% | |
| y | 11517 | 2.0% | |
| v | 10480 | 1.8% | |
| m | 10480 | 1.8% | |
| V | 10248 | 1.7% | |
| c | 10248 | 1.7% | |
| s | 9578 | 1.6% | |
| P | 8940 | 1.5% | |
| f | 7976 | 1.4% | |
| j | 7976 | 1.4% | |
| I | 7785 | 1.3% | |
| l | 7785 | 1.3% | |
| T | 6432 | 1.1% | |
| W | 5987 | 1.0% | |
| Other values (5) | 21018 | 3.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 49672 | 82.6% | ||
| / | 10480 | 17.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 647006 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 107025 | 16.5% | |
| i | 57807 | 8.9% | |
| n | 50807 | 7.9% | |
| 49672 | 7.7% | ||
| e | 36497 | 5.6% | |
| u | 35883 | 5.5% | |
| k | 33105 | 5.1% | |
| t | 27019 | 4.2% | |
| L | 24219 | 3.7% | |
| r | 22526 | 3.5% | |
| R | 20910 | 3.2% | |
| o | 19234 | 3.0% | |
| g | 15372 | 2.4% | |
| y | 11517 | 1.8% | |
| v | 10480 | 1.6% | |
| m | 10480 | 1.6% | |
| / | 10480 | 1.6% | |
| V | 10248 | 1.6% | |
| c | 10248 | 1.6% | |
| s | 9578 | 1.5% | |
| P | 8940 | 1.4% | |
| f | 7976 | 1.2% | |
| j | 7976 | 1.2% | |
| I | 7785 | 1.2% | |
| l | 7785 | 1.2% | |
| Other values (7) | 33437 | 5.2% |
| Distinct count | 19287 |
|---|---|
| Unique (%) | 32.7% |
| Missing | 371 |
| Missing (%) | 0.6% |
| Memory size | 3.4 MiB |
| Madukani | 508 |
|---|---|
| Shuleni | 506 |
| Majengo | 502 |
| Kati | 373 |
| Mtakuja | 262 |
| Other values (19282) |
| Value | Count | Frequency (%) | |
| Madukani | 508 | 0.9% | |
| Shuleni | 506 | 0.9% | |
| Majengo | 502 | 0.8% | |
| Kati | 373 | 0.6% | |
| Mtakuja | 262 | 0.4% | |
| Sokoni | 232 | 0.4% | |
| M | 187 | 0.3% | |
| Muungano | 172 | 0.3% | |
| Mbuyuni | 164 | 0.3% | |
| Mlimani | 152 | 0.3% | |
| Songambele | 147 | 0.2% | |
| Miembeni | 134 | 0.2% | |
| Msikitini | 134 | 0.2% | |
| 1 | 132 | 0.2% | |
| Kibaoni | 114 | 0.2% | |
| Kanisani | 111 | 0.2% | |
| Mapinduzi | 109 | 0.2% | |
| I | 109 | 0.2% | |
| Mjini | 108 | 0.2% | |
| Mjimwema | 108 | 0.2% | |
| Mkwajuni | 104 | 0.2% | |
| Mwenge | 102 | 0.2% | |
| Mabatini | 98 | 0.2% | |
| Azimio | 98 | 0.2% | |
| Mission | 95 | 0.2% | |
| Other values (19262) | 54268 | 91.4% | |
| (Missing) | 371 | 0.6% |
Length
| Max length | 30 |
|---|---|
| Median length | 7 |
| Mean length | 7.867003367 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 72374 | 15.5% | |
| i | 45666 | 9.8% | |
| n | 34241 | 7.3% | |
| u | 26424 | 5.7% | |
| e | 25671 | 5.5% | |
| o | 23556 | 5.0% | |
| M | 20431 | 4.4% | |
| g | 18951 | 4.1% | |
| l | 16372 | 3.5% | |
| m | 15053 | 3.2% | |
| K | 12545 | 2.7% | |
| b | 11843 | 2.5% | |
| 11766 | 2.5% | ||
| t | 11702 | 2.5% | |
| k | 11116 | 2.4% | |
| r | 10027 | 2.1% | |
| w | 10003 | 2.1% | |
| s | 9984 | 2.1% | |
| h | 9430 | 2.0% | |
| d | 8274 | 1.8% | |
| y | 7055 | 1.5% | |
| N | 6068 | 1.3% | |
| B | 5112 | 1.1% | |
| I | 4503 | 1.0% | |
| j | 4285 | 0.9% | |
| Other values (48) | 34848 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 382376 | 81.8% | |
| Uppercase Letter | 71291 | 15.3% | |
| Space Separator | 11766 | 2.5% | |
| Other Punctuation | 1184 | 0.3% | |
| Decimal Number | 589 | 0.1% | |
| Modifier Symbol | 45 | < 0.1% | |
| Dash Punctuation | 36 | < 0.1% | |
| Open Punctuation | 5 | < 0.1% | |
| Close Punctuation | 5 | < 0.1% | |
| Connector Punctuation | 3 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 20431 | 28.7% | |
| K | 12545 | 17.6% | |
| N | 6068 | 8.5% | |
| B | 5112 | 7.2% | |
| I | 4503 | 6.3% | |
| S | 4039 | 5.7% | |
| A | 3076 | 4.3% | |
| C | 2533 | 3.6% | |
| L | 2458 | 3.4% | |
| U | 1704 | 2.4% | |
| T | 1123 | 1.6% | |
| W | 1069 | 1.5% | |
| R | 905 | 1.3% | |
| O | 895 | 1.3% | |
| G | 894 | 1.3% | |
| J | 733 | 1.0% | |
| D | 629 | 0.9% | |
| P | 490 | 0.7% | |
| H | 489 | 0.7% | |
| E | 371 | 0.5% | |
| Z | 366 | 0.5% | |
| V | 333 | 0.5% | |
| Y | 283 | 0.4% | |
| F | 175 | 0.2% | |
| Q | 67 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 72374 | 18.9% | |
| i | 45666 | 11.9% | |
| n | 34241 | 9.0% | |
| u | 26424 | 6.9% | |
| e | 25671 | 6.7% | |
| o | 23556 | 6.2% | |
| g | 18951 | 5.0% | |
| l | 16372 | 4.3% | |
| m | 15053 | 3.9% | |
| b | 11843 | 3.1% | |
| t | 11702 | 3.1% | |
| k | 11116 | 2.9% | |
| r | 10027 | 2.6% | |
| w | 10003 | 2.6% | |
| s | 9984 | 2.6% | |
| h | 9430 | 2.5% | |
| d | 8274 | 2.2% | |
| y | 7055 | 1.8% | |
| j | 4285 | 1.1% | |
| z | 3722 | 1.0% | |
| p | 2825 | 0.7% | |
| c | 1593 | 0.4% | |
| f | 1098 | 0.3% | |
| v | 1045 | 0.3% | |
| q | 62 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 11766 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 1017 | 85.9% | |
| / | 136 | 11.5% | |
| . | 29 | 2.4% | |
| # | 2 | 0.2% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 242 | 41.1% | |
| 2 | 70 | 11.9% | |
| 3 | 50 | 8.5% | |
| 4 | 49 | 8.3% | |
| 6 | 33 | 5.6% | |
| 8 | 32 | 5.4% | |
| 9 | 32 | 5.4% | |
| 0 | 30 | 5.1% | |
| 5 | 29 | 4.9% | |
| 7 | 22 | 3.7% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 45 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 4 | 80.0% | |
| [ | 1 | 20.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 4 | 80.0% | |
| ] | 1 | 20.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 36 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 3 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 453667 | 97.1% | |
| Common | 13633 | 2.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 72374 | 16.0% | |
| i | 45666 | 10.1% | |
| n | 34241 | 7.5% | |
| u | 26424 | 5.8% | |
| e | 25671 | 5.7% | |
| o | 23556 | 5.2% | |
| M | 20431 | 4.5% | |
| g | 18951 | 4.2% | |
| l | 16372 | 3.6% | |
| m | 15053 | 3.3% | |
| K | 12545 | 2.8% | |
| b | 11843 | 2.6% | |
| t | 11702 | 2.6% | |
| k | 11116 | 2.5% | |
| r | 10027 | 2.2% | |
| w | 10003 | 2.2% | |
| s | 9984 | 2.2% | |
| h | 9430 | 2.1% | |
| d | 8274 | 1.8% | |
| y | 7055 | 1.6% | |
| N | 6068 | 1.3% | |
| B | 5112 | 1.1% | |
| I | 4503 | 1.0% | |
| j | 4285 | 0.9% | |
| S | 4039 | 0.9% | |
| Other values (26) | 28942 | 6.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 11766 | 86.3% | ||
| ' | 1017 | 7.5% | |
| 1 | 242 | 1.8% | |
| / | 136 | 1.0% | |
| 2 | 70 | 0.5% | |
| 3 | 50 | 0.4% | |
| 4 | 49 | 0.4% | |
| ` | 45 | 0.3% | |
| - | 36 | 0.3% | |
| 6 | 33 | 0.2% | |
| 8 | 32 | 0.2% | |
| 9 | 32 | 0.2% | |
| 0 | 30 | 0.2% | |
| 5 | 29 | 0.2% | |
| . | 29 | 0.2% | |
| 7 | 22 | 0.2% | |
| ( | 4 | < 0.1% | |
| ) | 4 | < 0.1% | |
| _ | 3 | < 0.1% | |
| # | 2 | < 0.1% | |
| [ | 1 | < 0.1% | |
| ] | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 467300 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 72374 | 15.5% | |
| i | 45666 | 9.8% | |
| n | 34241 | 7.3% | |
| u | 26424 | 5.7% | |
| e | 25671 | 5.5% | |
| o | 23556 | 5.0% | |
| M | 20431 | 4.4% | |
| g | 18951 | 4.1% | |
| l | 16372 | 3.5% | |
| m | 15053 | 3.2% | |
| K | 12545 | 2.7% | |
| b | 11843 | 2.5% | |
| 11766 | 2.5% | ||
| t | 11702 | 2.5% | |
| k | 11116 | 2.4% | |
| r | 10027 | 2.1% | |
| w | 10003 | 2.1% | |
| s | 9984 | 2.1% | |
| h | 9430 | 2.0% | |
| d | 8274 | 1.8% | |
| y | 7055 | 1.5% | |
| N | 6068 | 1.3% | |
| B | 5112 | 1.1% | |
| I | 4503 | 1.0% | |
| j | 4285 | 0.9% | |
| Other values (48) | 34848 | 7.5% |
region
Categorical
| Distinct count | 21 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| Iringa | 5294 |
|---|---|
| Shinyanga | 4982 |
| Mbeya | 4639 |
| Kilimanjaro | 4379 |
| Morogoro | 4006 |
| Other values (16) |
| Value | Count | Frequency (%) | |
| Iringa | 5294 | 8.9% | |
| Shinyanga | 4982 | 8.4% | |
| Mbeya | 4639 | 7.8% | |
| Kilimanjaro | 4379 | 7.4% | |
| Morogoro | 4006 | 6.7% | |
| Arusha | 3350 | 5.6% | |
| Kagera | 3316 | 5.6% | |
| Mwanza | 3102 | 5.2% | |
| Kigoma | 2816 | 4.7% | |
| Ruvuma | 2640 | 4.4% | |
| Pwani | 2635 | 4.4% | |
| Tanga | 2547 | 4.3% | |
| Dodoma | 2201 | 3.7% | |
| Singida | 2093 | 3.5% | |
| Mara | 1969 | 3.3% | |
| Tabora | 1959 | 3.3% | |
| Rukwa | 1808 | 3.0% | |
| Mtwara | 1730 | 2.9% | |
| Manyara | 1583 | 2.7% | |
| Lindi | 1546 | 2.6% | |
| Dar es Salaam | 805 | 1.4% |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.623754209 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 83413 | 21.2% | |
| n | 33143 | 8.4% | |
| r | 32397 | 8.2% | |
| i | 31763 | 8.1% | |
| o | 29580 | 7.5% | |
| g | 25054 | 6.4% | |
| M | 17029 | 4.3% | |
| m | 12841 | 3.3% | |
| y | 11204 | 2.8% | |
| K | 10511 | 2.7% | |
| u | 10438 | 2.7% | |
| w | 9275 | 2.4% | |
| e | 8760 | 2.2% | |
| h | 8332 | 2.1% | |
| S | 7880 | 2.0% | |
| b | 6598 | 1.7% | |
| d | 5840 | 1.5% | |
| I | 5294 | 1.3% | |
| l | 5184 | 1.3% | |
| T | 4506 | 1.1% | |
| R | 4448 | 1.1% | |
| j | 4379 | 1.1% | |
| s | 4155 | 1.1% | |
| A | 3350 | 0.9% | |
| z | 3102 | 0.8% | |
| Other values (7) | 14975 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 331636 | 84.3% | |
| Uppercase Letter | 60205 | 15.3% | |
| Space Separator | 1610 | 0.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 17029 | 28.3% | |
| K | 10511 | 17.5% | |
| S | 7880 | 13.1% | |
| I | 5294 | 8.8% | |
| T | 4506 | 7.5% | |
| R | 4448 | 7.4% | |
| A | 3350 | 5.6% | |
| D | 3006 | 5.0% | |
| P | 2635 | 4.4% | |
| L | 1546 | 2.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 83413 | 25.2% | |
| n | 33143 | 10.0% | |
| r | 32397 | 9.8% | |
| i | 31763 | 9.6% | |
| o | 29580 | 8.9% | |
| g | 25054 | 7.6% | |
| m | 12841 | 3.9% | |
| y | 11204 | 3.4% | |
| u | 10438 | 3.1% | |
| w | 9275 | 2.8% | |
| e | 8760 | 2.6% | |
| h | 8332 | 2.5% | |
| b | 6598 | 2.0% | |
| d | 5840 | 1.8% | |
| l | 5184 | 1.6% | |
| j | 4379 | 1.3% | |
| s | 4155 | 1.3% | |
| z | 3102 | 0.9% | |
| v | 2640 | 0.8% | |
| k | 1808 | 0.5% | |
| t | 1730 | 0.5% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1610 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 391841 | 99.6% | |
| Common | 1610 | 0.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 83413 | 21.3% | |
| n | 33143 | 8.5% | |
| r | 32397 | 8.3% | |
| i | 31763 | 8.1% | |
| o | 29580 | 7.5% | |
| g | 25054 | 6.4% | |
| M | 17029 | 4.3% | |
| m | 12841 | 3.3% | |
| y | 11204 | 2.9% | |
| K | 10511 | 2.7% | |
| u | 10438 | 2.7% | |
| w | 9275 | 2.4% | |
| e | 8760 | 2.2% | |
| h | 8332 | 2.1% | |
| S | 7880 | 2.0% | |
| b | 6598 | 1.7% | |
| d | 5840 | 1.5% | |
| I | 5294 | 1.4% | |
| l | 5184 | 1.3% | |
| T | 4506 | 1.1% | |
| R | 4448 | 1.1% | |
| j | 4379 | 1.1% | |
| s | 4155 | 1.1% | |
| A | 3350 | 0.9% | |
| z | 3102 | 0.8% | |
| Other values (6) | 13365 | 3.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1610 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 393451 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 83413 | 21.2% | |
| n | 33143 | 8.4% | |
| r | 32397 | 8.2% | |
| i | 31763 | 8.1% | |
| o | 29580 | 7.5% | |
| g | 25054 | 6.4% | |
| M | 17029 | 4.3% | |
| m | 12841 | 3.3% | |
| y | 11204 | 2.8% | |
| K | 10511 | 2.7% | |
| u | 10438 | 2.7% | |
| w | 9275 | 2.4% | |
| e | 8760 | 2.2% | |
| h | 8332 | 2.1% | |
| S | 7880 | 2.0% | |
| b | 6598 | 1.7% | |
| d | 5840 | 1.5% | |
| I | 5294 | 1.3% | |
| l | 5184 | 1.3% | |
| T | 4506 | 1.1% | |
| R | 4448 | 1.1% | |
| j | 4379 | 1.1% | |
| s | 4155 | 1.1% | |
| A | 3350 | 0.9% | |
| z | 3102 | 0.8% | |
| Other values (7) | 14975 | 3.8% |
region_code
Real number (ℝ≥0)
| Distinct count | 27 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.297003367003366 |
|---|---|
| Minimum | 1 |
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 12 |
| Q3 | 17 |
| 95-th percentile | 60 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 17.58740634 |
|---|---|
| Coefficient of variation (CV) | 1.149728866 |
| Kurtosis | 10.28843341 |
| Mean | 15.29700337 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 3.17381811 |
| Sum | 908642 |
| Variance | 309.3168617 |
| Value | Count | Frequency (%) | |
| 11 | 5300 | 8.9% | |
| 17 | 5011 | 8.4% | |
| 12 | 4639 | 7.8% | |
| 3 | 4379 | 7.4% | |
| 5 | 4040 | 6.8% | |
| 18 | 3324 | 5.6% | |
| 19 | 3047 | 5.1% | |
| 2 | 3024 | 5.1% | |
| 16 | 2816 | 4.7% | |
| 10 | 2640 | 4.4% | |
| 4 | 2513 | 4.2% | |
| 1 | 2201 | 3.7% | |
| 13 | 2093 | 3.5% | |
| 14 | 1979 | 3.3% | |
| 20 | 1969 | 3.3% | |
| 15 | 1808 | 3.0% | |
| 6 | 1609 | 2.7% | |
| 21 | 1583 | 2.7% | |
| 80 | 1238 | 2.1% | |
| 60 | 1025 | 1.7% | |
| 90 | 917 | 1.5% | |
| 7 | 805 | 1.4% | |
| 99 | 423 | 0.7% | |
| 9 | 390 | 0.7% | |
| 24 | 326 | 0.5% | |
| Other values (2) | 301 | 0.5% |
| Value | Count | Frequency (%) | |
| 1 | 2201 | 3.7% | |
| 2 | 3024 | 5.1% | |
| 3 | 4379 | 7.4% | |
| 4 | 2513 | 4.2% | |
| 5 | 4040 | 6.8% | |
| 6 | 1609 | 2.7% | |
| 7 | 805 | 1.4% | |
| 8 | 300 | 0.5% | |
| 9 | 390 | 0.7% | |
| 10 | 2640 | 4.4% |
| Value | Count | Frequency (%) | |
| 99 | 423 | 0.7% | |
| 90 | 917 | 1.5% | |
| 80 | 1238 | 2.1% | |
| 60 | 1025 | 1.7% | |
| 40 | 1 | < 0.1% | |
| 24 | 326 | 0.5% | |
| 21 | 1583 | 2.7% | |
| 20 | 1969 | 3.3% | |
| 19 | 3047 | 5.1% | |
| 18 | 3324 | 5.6% |
district_code
Real number (ℝ≥0)
| Distinct count | 20 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.629747474747475 |
|---|---|
| Minimum | 0 |
| Maximum | 80 |
| Zeros | 23 |
| Zeros (%) | < 0.1% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 30 |
| Maximum | 80 |
| Range | 80 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 9.633648629 |
|---|---|
| Coefficient of variation (CV) | 1.711204396 |
| Kurtosis | 16.21428363 |
| Mean | 5.629747475 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.962045299 |
| Sum | 334407 |
| Variance | 92.80718592 |
| Value | Count | Frequency (%) | |
| 1 | 12203 | 20.5% | |
| 2 | 11173 | 18.8% | |
| 3 | 9998 | 16.8% | |
| 4 | 8999 | 15.1% | |
| 5 | 4356 | 7.3% | |
| 6 | 4074 | 6.9% | |
| 7 | 3343 | 5.6% | |
| 8 | 1043 | 1.8% | |
| 30 | 995 | 1.7% | |
| 33 | 874 | 1.5% | |
| 53 | 745 | 1.3% | |
| 43 | 505 | 0.9% | |
| 13 | 391 | 0.7% | |
| 23 | 293 | 0.5% | |
| 63 | 195 | 0.3% | |
| 62 | 109 | 0.2% | |
| 60 | 63 | 0.1% | |
| 0 | 23 | < 0.1% | |
| 80 | 12 | < 0.1% | |
| 67 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 23 | < 0.1% | |
| 1 | 12203 | 20.5% | |
| 2 | 11173 | 18.8% | |
| 3 | 9998 | 16.8% | |
| 4 | 8999 | 15.1% | |
| 5 | 4356 | 7.3% | |
| 6 | 4074 | 6.9% | |
| 7 | 3343 | 5.6% | |
| 8 | 1043 | 1.8% | |
| 13 | 391 | 0.7% |
| Value | Count | Frequency (%) | |
| 80 | 12 | < 0.1% | |
| 67 | 6 | < 0.1% | |
| 63 | 195 | 0.3% | |
| 62 | 109 | 0.2% | |
| 60 | 63 | 0.1% | |
| 53 | 745 | 1.3% | |
| 43 | 505 | 0.9% | |
| 33 | 874 | 1.5% | |
| 30 | 995 | 1.7% | |
| 23 | 293 | 0.5% |
| Distinct count | 125 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| Njombe | 2503 |
|---|---|
| Arusha Rural | 1252 |
| Moshi Rural | 1251 |
| Bariadi | 1177 |
| Rungwe | 1106 |
| Other values (120) |
| Value | Count | Frequency (%) | |
| Njombe | 2503 | 4.2% | |
| Arusha Rural | 1252 | 2.1% | |
| Moshi Rural | 1251 | 2.1% | |
| Bariadi | 1177 | 2.0% | |
| Rungwe | 1106 | 1.9% | |
| Kilosa | 1094 | 1.8% | |
| Kasulu | 1047 | 1.8% | |
| Mbozi | 1034 | 1.7% | |
| Meru | 1009 | 1.7% | |
| Bagamoyo | 997 | 1.7% | |
| Singida Rural | 995 | 1.7% | |
| Kilombero | 959 | 1.6% | |
| Same | 877 | 1.5% | |
| Kibondo | 874 | 1.5% | |
| Kyela | 859 | 1.4% | |
| Kahama | 836 | 1.4% | |
| Magu | 824 | 1.4% | |
| Kigoma Rural | 824 | 1.4% | |
| Maswa | 809 | 1.4% | |
| Karagwe | 771 | 1.3% | |
| Mbinga | 750 | 1.3% | |
| Iringa Rural | 728 | 1.2% | |
| Serengeti | 716 | 1.2% | |
| Namtumbo | 694 | 1.2% | |
| Lushoto | 694 | 1.2% | |
| Other values (100) | 34720 | 58.5% |
Length
| Max length | 16 |
|---|---|
| Median length | 6 |
| Mean length | 7.416885522 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 69982 | 15.9% | |
| o | 30079 | 6.8% | |
| i | 29483 | 6.7% | |
| u | 28324 | 6.4% | |
| r | 26886 | 6.1% | |
| e | 22579 | 5.1% | |
| n | 22521 | 5.1% | |
| l | 19238 | 4.4% | |
| g | 18385 | 4.2% | |
| M | 16017 | 3.6% | |
| m | 15622 | 3.5% | |
| b | 15603 | 3.5% | |
| R | 12207 | 2.8% | |
| K | 11663 | 2.6% | |
| 11235 | 2.6% | ||
| w | 9820 | 2.2% | |
| s | 9747 | 2.2% | |
| h | 8464 | 1.9% | |
| d | 8410 | 1.9% | |
| S | 6261 | 1.4% | |
| N | 5760 | 1.3% | |
| t | 5696 | 1.3% | |
| B | 4839 | 1.1% | |
| y | 4763 | 1.1% | |
| k | 3721 | 0.8% | |
| Other values (16) | 23258 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 358693 | 81.4% | |
| Uppercase Letter | 70635 | 16.0% | |
| Space Separator | 11235 | 2.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 16017 | 22.7% | |
| R | 12207 | 17.3% | |
| K | 11663 | 16.5% | |
| S | 6261 | 8.9% | |
| N | 5760 | 8.2% | |
| B | 4839 | 6.9% | |
| U | 3410 | 4.8% | |
| I | 2480 | 3.5% | |
| L | 2131 | 3.0% | |
| T | 1367 | 1.9% | |
| A | 1315 | 1.9% | |
| H | 1153 | 1.6% | |
| C | 881 | 1.2% | |
| G | 488 | 0.7% | |
| D | 358 | 0.5% | |
| P | 305 | 0.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 69982 | 19.5% | |
| o | 30079 | 8.4% | |
| i | 29483 | 8.2% | |
| u | 28324 | 7.9% | |
| r | 26886 | 7.5% | |
| e | 22579 | 6.3% | |
| n | 22521 | 6.3% | |
| l | 19238 | 5.4% | |
| g | 18385 | 5.1% | |
| m | 15622 | 4.4% | |
| b | 15603 | 4.3% | |
| w | 9820 | 2.7% | |
| s | 9747 | 2.7% | |
| h | 8464 | 2.4% | |
| d | 8410 | 2.3% | |
| t | 5696 | 1.6% | |
| y | 4763 | 1.3% | |
| k | 3721 | 1.0% | |
| j | 3496 | 1.0% | |
| z | 1943 | 0.5% | |
| p | 1854 | 0.5% | |
| f | 1106 | 0.3% | |
| v | 671 | 0.2% | |
| c | 300 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 11235 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 429328 | 97.4% | |
| Common | 11235 | 2.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 69982 | 16.3% | |
| o | 30079 | 7.0% | |
| i | 29483 | 6.9% | |
| u | 28324 | 6.6% | |
| r | 26886 | 6.3% | |
| e | 22579 | 5.3% | |
| n | 22521 | 5.2% | |
| l | 19238 | 4.5% | |
| g | 18385 | 4.3% | |
| M | 16017 | 3.7% | |
| m | 15622 | 3.6% | |
| b | 15603 | 3.6% | |
| R | 12207 | 2.8% | |
| K | 11663 | 2.7% | |
| w | 9820 | 2.3% | |
| s | 9747 | 2.3% | |
| h | 8464 | 2.0% | |
| d | 8410 | 2.0% | |
| S | 6261 | 1.5% | |
| N | 5760 | 1.3% | |
| t | 5696 | 1.3% | |
| B | 4839 | 1.1% | |
| y | 4763 | 1.1% | |
| k | 3721 | 0.9% | |
| j | 3496 | 0.8% | |
| Other values (15) | 19762 | 4.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 11235 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 440563 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 69982 | 15.9% | |
| o | 30079 | 6.8% | |
| i | 29483 | 6.7% | |
| u | 28324 | 6.4% | |
| r | 26886 | 6.1% | |
| e | 22579 | 5.1% | |
| n | 22521 | 5.1% | |
| l | 19238 | 4.4% | |
| g | 18385 | 4.2% | |
| M | 16017 | 3.6% | |
| m | 15622 | 3.5% | |
| b | 15603 | 3.5% | |
| R | 12207 | 2.8% | |
| K | 11663 | 2.6% | |
| 11235 | 2.6% | ||
| w | 9820 | 2.2% | |
| s | 9747 | 2.2% | |
| h | 8464 | 1.9% | |
| d | 8410 | 1.9% | |
| S | 6261 | 1.4% | |
| N | 5760 | 1.3% | |
| t | 5696 | 1.3% | |
| B | 4839 | 1.1% | |
| y | 4763 | 1.1% | |
| k | 3721 | 0.8% | |
| Other values (16) | 23258 | 5.3% |
| Distinct count | 2092 |
|---|---|
| Unique (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| Igosi | 307 |
|---|---|
| Imalinyi | 252 |
| Siha Kati | 232 |
| Mdandu | 231 |
| Nduruma | 217 |
| Other values (2087) |
| Value | Count | Frequency (%) | |
| Igosi | 307 | 0.5% | |
| Imalinyi | 252 | 0.4% | |
| Siha Kati | 232 | 0.4% | |
| Mdandu | 231 | 0.4% | |
| Nduruma | 217 | 0.4% | |
| Mishamo | 203 | 0.3% | |
| Kitunda | 203 | 0.3% | |
| Msindo | 201 | 0.3% | |
| Chalinze | 196 | 0.3% | |
| Maji ya Chai | 190 | 0.3% | |
| Usuka | 187 | 0.3% | |
| Ngarenanyuki | 172 | 0.3% | |
| Chanika | 171 | 0.3% | |
| Vikindu | 162 | 0.3% | |
| Mtwango | 153 | 0.3% | |
| Matola | 145 | 0.2% | |
| Zinga/Ikerege | 141 | 0.2% | |
| Maramba | 139 | 0.2% | |
| Wanging'ombe | 139 | 0.2% | |
| Itete | 137 | 0.2% | |
| Magomeni | 135 | 0.2% | |
| Kikatiti | 134 | 0.2% | |
| Ifakara | 134 | 0.2% | |
| Olkokola | 133 | 0.2% | |
| Maposeni | 130 | 0.2% | |
| Other values (2067) | 54956 | 92.5% |
Length
| Max length | 23 |
|---|---|
| Median length | 7 |
| Mean length | 7.505841751 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 69533 | 15.6% | |
| i | 40243 | 9.0% | |
| n | 29584 | 6.6% | |
| u | 27015 | 6.1% | |
| o | 26093 | 5.9% | |
| e | 23589 | 5.3% | |
| g | 21166 | 4.7% | |
| M | 18916 | 4.2% | |
| m | 16216 | 3.6% | |
| l | 15799 | 3.5% | |
| r | 13057 | 2.9% | |
| b | 12816 | 2.9% | |
| s | 11335 | 2.5% | |
| K | 11212 | 2.5% | |
| h | 10975 | 2.5% | |
| k | 10812 | 2.4% | |
| t | 9311 | 2.1% | |
| w | 9137 | 2.0% | |
| d | 8960 | 2.0% | |
| y | 7186 | 1.6% | |
| I | 6094 | 1.4% | |
| N | 5919 | 1.3% | |
| 5408 | 1.2% | ||
| z | 3577 | 0.8% | |
| S | 3354 | 0.8% | |
| Other values (29) | 28540 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 374730 | 84.0% | |
| Uppercase Letter | 64523 | 14.5% | |
| Space Separator | 5408 | 1.2% | |
| Other Punctuation | 1163 | 0.3% | |
| Dash Punctuation | 23 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 18916 | 29.3% | |
| K | 11212 | 17.4% | |
| I | 6094 | 9.4% | |
| N | 5919 | 9.2% | |
| S | 3354 | 5.2% | |
| L | 3162 | 4.9% | |
| B | 3098 | 4.8% | |
| U | 2913 | 4.5% | |
| C | 2123 | 3.3% | |
| R | 1692 | 2.6% | |
| T | 776 | 1.2% | |
| D | 743 | 1.2% | |
| O | 661 | 1.0% | |
| V | 634 | 1.0% | |
| P | 577 | 0.9% | |
| H | 551 | 0.9% | |
| W | 387 | 0.6% | |
| G | 369 | 0.6% | |
| Z | 339 | 0.5% | |
| E | 289 | 0.4% | |
| A | 260 | 0.4% | |
| J | 187 | 0.3% | |
| Y | 149 | 0.2% | |
| Q | 76 | 0.1% | |
| F | 42 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 69533 | 18.6% | |
| i | 40243 | 10.7% | |
| n | 29584 | 7.9% | |
| u | 27015 | 7.2% | |
| o | 26093 | 7.0% | |
| e | 23589 | 6.3% | |
| g | 21166 | 5.6% | |
| m | 16216 | 4.3% | |
| l | 15799 | 4.2% | |
| r | 13057 | 3.5% | |
| b | 12816 | 3.4% | |
| s | 11335 | 3.0% | |
| h | 10975 | 2.9% | |
| k | 10812 | 2.9% | |
| t | 9311 | 2.5% | |
| w | 9137 | 2.4% | |
| d | 8960 | 2.4% | |
| y | 7186 | 1.9% | |
| z | 3577 | 1.0% | |
| p | 2895 | 0.8% | |
| j | 2446 | 0.7% | |
| c | 1376 | 0.4% | |
| f | 816 | 0.2% | |
| v | 777 | 0.2% | |
| q | 16 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 5408 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 1013 | 87.1% | |
| / | 150 | 12.9% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 23 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 439253 | 98.5% | |
| Common | 6594 | 1.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 69533 | 15.8% | |
| i | 40243 | 9.2% | |
| n | 29584 | 6.7% | |
| u | 27015 | 6.2% | |
| o | 26093 | 5.9% | |
| e | 23589 | 5.4% | |
| g | 21166 | 4.8% | |
| M | 18916 | 4.3% | |
| m | 16216 | 3.7% | |
| l | 15799 | 3.6% | |
| r | 13057 | 3.0% | |
| b | 12816 | 2.9% | |
| s | 11335 | 2.6% | |
| K | 11212 | 2.6% | |
| h | 10975 | 2.5% | |
| k | 10812 | 2.5% | |
| t | 9311 | 2.1% | |
| w | 9137 | 2.1% | |
| d | 8960 | 2.0% | |
| y | 7186 | 1.6% | |
| I | 6094 | 1.4% | |
| N | 5919 | 1.3% | |
| z | 3577 | 0.8% | |
| S | 3354 | 0.8% | |
| L | 3162 | 0.7% | |
| Other values (25) | 24192 | 5.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 5408 | 82.0% | ||
| ' | 1013 | 15.4% | |
| / | 150 | 2.3% | |
| - | 23 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 445847 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 69533 | 15.6% | |
| i | 40243 | 9.0% | |
| n | 29584 | 6.6% | |
| u | 27015 | 6.1% | |
| o | 26093 | 5.9% | |
| e | 23589 | 5.3% | |
| g | 21166 | 4.7% | |
| M | 18916 | 4.2% | |
| m | 16216 | 3.6% | |
| l | 15799 | 3.5% | |
| r | 13057 | 2.9% | |
| b | 12816 | 2.9% | |
| s | 11335 | 2.5% | |
| K | 11212 | 2.5% | |
| h | 10975 | 2.5% | |
| k | 10812 | 2.4% | |
| t | 9311 | 2.1% | |
| w | 9137 | 2.0% | |
| d | 8960 | 2.0% | |
| y | 7186 | 1.6% | |
| I | 6094 | 1.4% | |
| N | 5919 | 1.3% | |
| 5408 | 1.2% | ||
| z | 3577 | 0.8% | |
| S | 3354 | 0.8% | |
| Other values (29) | 28540 | 6.4% |
| Distinct count | 1049 |
|---|---|
| Unique (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179.90998316498317 |
|---|---|
| Minimum | 0 |
| Maximum | 30500 |
| Zeros | 21381 |
| Zeros (%) | 36.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 25 |
| Q3 | 215 |
| 95-th percentile | 680 |
| Maximum | 30500 |
| Range | 30500 |
| Interquartile range (IQR) | 215 |
Descriptive statistics
| Standard deviation | 471.4821757 |
|---|---|
| Coefficient of variation (CV) | 2.620655994 |
| Kurtosis | 402.2801153 |
| Mean | 179.9099832 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 12.66071359 |
| Sum | 10686653 |
| Variance | 222295.442 |
| Value | Count | Frequency (%) | |
| 0 | 21381 | 36.0% | |
| 1 | 7025 | 11.8% | |
| 200 | 1940 | 3.3% | |
| 150 | 1892 | 3.2% | |
| 250 | 1681 | 2.8% | |
| 300 | 1476 | 2.5% | |
| 100 | 1146 | 1.9% | |
| 50 | 1139 | 1.9% | |
| 500 | 1009 | 1.7% | |
| 350 | 986 | 1.7% | |
| 120 | 916 | 1.5% | |
| 400 | 775 | 1.3% | |
| 60 | 706 | 1.2% | |
| 30 | 626 | 1.1% | |
| 40 | 552 | 0.9% | |
| 80 | 533 | 0.9% | |
| 450 | 499 | 0.8% | |
| 20 | 462 | 0.8% | |
| 600 | 438 | 0.7% | |
| 230 | 388 | 0.7% | |
| 75 | 289 | 0.5% | |
| 1000 | 278 | 0.5% | |
| 800 | 269 | 0.5% | |
| 90 | 265 | 0.4% | |
| 130 | 264 | 0.4% | |
| Other values (1024) | 12465 | 21.0% |
| Value | Count | Frequency (%) | |
| 0 | 21381 | 36.0% | |
| 1 | 7025 | 11.8% | |
| 2 | 4 | < 0.1% | |
| 3 | 4 | < 0.1% | |
| 4 | 13 | < 0.1% | |
| 5 | 44 | 0.1% | |
| 6 | 19 | < 0.1% | |
| 7 | 3 | < 0.1% | |
| 8 | 23 | < 0.1% | |
| 9 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| 30500 | 1 | < 0.1% | |
| 15300 | 1 | < 0.1% | |
| 11463 | 1 | < 0.1% | |
| 10000 | 3 | < 0.1% | |
| 9865 | 1 | < 0.1% | |
| 9500 | 1 | < 0.1% | |
| 9000 | 3 | < 0.1% | |
| 8848 | 1 | < 0.1% | |
| 8600 | 1 | < 0.1% | |
| 8500 | 1 | < 0.1% |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 3334 |
| Missing (%) | 5.6% |
| Memory size | 3.4 MiB |
| True | |
|---|---|
| False | 5055 |
| (Missing) | 3334 |
| Value | Count | Frequency (%) | |
| True | 51011 | 85.9% | |
| False | 5055 | 8.5% | |
| (Missing) | 3334 | 5.6% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| GeoData Consultants Ltd |
|---|
| Value | Count | Frequency (%) | |
| GeoData Consultants Ltd | 59400 | 100.0% |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 237600 | 17.4% | |
| a | 178200 | 13.0% | |
| o | 118800 | 8.7% | |
| 118800 | 8.7% | ||
| n | 118800 | 8.7% | |
| s | 118800 | 8.7% | |
| G | 59400 | 4.3% | |
| e | 59400 | 4.3% | |
| D | 59400 | 4.3% | |
| C | 59400 | 4.3% | |
| u | 59400 | 4.3% | |
| l | 59400 | 4.3% | |
| L | 59400 | 4.3% | |
| d | 59400 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1009800 | 73.9% | |
| Uppercase Letter | 237600 | 17.4% | |
| Space Separator | 118800 | 8.7% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| G | 59400 | 25.0% | |
| D | 59400 | 25.0% | |
| C | 59400 | 25.0% | |
| L | 59400 | 25.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 237600 | 23.5% | |
| a | 178200 | 17.6% | |
| o | 118800 | 11.8% | |
| n | 118800 | 11.8% | |
| s | 118800 | 11.8% | |
| e | 59400 | 5.9% | |
| u | 59400 | 5.9% | |
| l | 59400 | 5.9% | |
| d | 59400 | 5.9% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 118800 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1247400 | 91.3% | |
| Common | 118800 | 8.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 237600 | 19.0% | |
| a | 178200 | 14.3% | |
| o | 118800 | 9.5% | |
| n | 118800 | 9.5% | |
| s | 118800 | 9.5% | |
| G | 59400 | 4.8% | |
| e | 59400 | 4.8% | |
| D | 59400 | 4.8% | |
| C | 59400 | 4.8% | |
| u | 59400 | 4.8% | |
| l | 59400 | 4.8% | |
| L | 59400 | 4.8% | |
| d | 59400 | 4.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 118800 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1366200 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 237600 | 17.4% | |
| a | 178200 | 13.0% | |
| o | 118800 | 8.7% | |
| 118800 | 8.7% | ||
| n | 118800 | 8.7% | |
| s | 118800 | 8.7% | |
| G | 59400 | 4.3% | |
| e | 59400 | 4.3% | |
| D | 59400 | 4.3% | |
| C | 59400 | 4.3% | |
| u | 59400 | 4.3% | |
| l | 59400 | 4.3% | |
| L | 59400 | 4.3% | |
| d | 59400 | 4.3% |
| Distinct count | 12 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 3877 |
| Missing (%) | 6.5% |
| Memory size | 3.4 MiB |
| VWC | |
|---|---|
| WUG | 5206 |
| Water authority | 3153 |
| WUA | 2883 |
| Water Board | 2748 |
| Other values (7) | 4740 |
| Value | Count | Frequency (%) | |
| VWC | 36793 | 61.9% | |
| WUG | 5206 | 8.8% | |
| Water authority | 3153 | 5.3% | |
| WUA | 2883 | 4.9% | |
| Water Board | 2748 | 4.6% | |
| Parastatal | 1680 | 2.8% | |
| Private operator | 1063 | 1.8% | |
| Company | 1061 | 1.8% | |
| Other | 766 | 1.3% | |
| SWC | 97 | 0.2% | |
| Trust | 72 | 0.1% | |
| None | 1 | < 0.1% | |
| (Missing) | 3877 | 6.5% |
Length
| Max length | 16 |
|---|---|
| Median length | 3 |
| Mean length | 4.537373737 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| W | 50880 | 18.9% | |
| C | 37951 | 14.1% | |
| V | 36793 | 13.7% | |
| a | 25586 | 9.5% | |
| t | 18531 | 6.9% | |
| r | 17509 | 6.5% | |
| o | 9089 | 3.4% | |
| n | 8816 | 3.3% | |
| e | 8794 | 3.3% | |
| U | 8089 | 3.0% | |
| 6964 | 2.6% | ||
| G | 5206 | 1.9% | |
| i | 4216 | 1.6% | |
| y | 4214 | 1.6% | |
| h | 3919 | 1.5% | |
| u | 3225 | 1.2% | |
| A | 2883 | 1.1% | |
| B | 2748 | 1.0% | |
| d | 2748 | 1.0% | |
| P | 2743 | 1.0% | |
| p | 2124 | 0.8% | |
| s | 1752 | 0.7% | |
| l | 1680 | 0.6% | |
| v | 1063 | 0.4% | |
| m | 1061 | 0.4% | |
| Other values (4) | 936 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 148229 | 55.0% | |
| Lowercase Letter | 114327 | 42.4% | |
| Space Separator | 6964 | 2.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| W | 50880 | 34.3% | |
| C | 37951 | 25.6% | |
| V | 36793 | 24.8% | |
| U | 8089 | 5.5% | |
| G | 5206 | 3.5% | |
| A | 2883 | 1.9% | |
| B | 2748 | 1.9% | |
| P | 2743 | 1.9% | |
| O | 766 | 0.5% | |
| S | 97 | 0.1% | |
| T | 72 | < 0.1% | |
| N | 1 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 25586 | 22.4% | |
| t | 18531 | 16.2% | |
| r | 17509 | 15.3% | |
| o | 9089 | 8.0% | |
| n | 8816 | 7.7% | |
| e | 8794 | 7.7% | |
| i | 4216 | 3.7% | |
| y | 4214 | 3.7% | |
| h | 3919 | 3.4% | |
| u | 3225 | 2.8% | |
| d | 2748 | 2.4% | |
| p | 2124 | 1.9% | |
| s | 1752 | 1.5% | |
| l | 1680 | 1.5% | |
| v | 1063 | 0.9% | |
| m | 1061 | 0.9% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 6964 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 262556 | 97.4% | |
| Common | 6964 | 2.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| W | 50880 | 19.4% | |
| C | 37951 | 14.5% | |
| V | 36793 | 14.0% | |
| a | 25586 | 9.7% | |
| t | 18531 | 7.1% | |
| r | 17509 | 6.7% | |
| o | 9089 | 3.5% | |
| n | 8816 | 3.4% | |
| e | 8794 | 3.3% | |
| U | 8089 | 3.1% | |
| G | 5206 | 2.0% | |
| i | 4216 | 1.6% | |
| y | 4214 | 1.6% | |
| h | 3919 | 1.5% | |
| u | 3225 | 1.2% | |
| A | 2883 | 1.1% | |
| B | 2748 | 1.0% | |
| d | 2748 | 1.0% | |
| P | 2743 | 1.0% | |
| p | 2124 | 0.8% | |
| s | 1752 | 0.7% | |
| l | 1680 | 0.6% | |
| v | 1063 | 0.4% | |
| m | 1061 | 0.4% | |
| O | 766 | 0.3% | |
| Other values (3) | 170 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 6964 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 269520 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| W | 50880 | 18.9% | |
| C | 37951 | 14.1% | |
| V | 36793 | 13.7% | |
| a | 25586 | 9.5% | |
| t | 18531 | 6.9% | |
| r | 17509 | 6.5% | |
| o | 9089 | 3.4% | |
| n | 8816 | 3.3% | |
| e | 8794 | 3.3% | |
| U | 8089 | 3.0% | |
| 6964 | 2.6% | ||
| G | 5206 | 1.9% | |
| i | 4216 | 1.6% | |
| y | 4214 | 1.6% | |
| h | 3919 | 1.5% | |
| u | 3225 | 1.2% | |
| A | 2883 | 1.1% | |
| B | 2748 | 1.0% | |
| d | 2748 | 1.0% | |
| P | 2743 | 1.0% | |
| p | 2124 | 0.8% | |
| s | 1752 | 0.7% | |
| l | 1680 | 0.6% | |
| v | 1063 | 0.4% | |
| m | 1061 | 0.4% | |
| Other values (4) | 936 | 0.3% |
| Distinct count | 2696 |
|---|---|
| Unique (%) | 8.6% |
| Missing | 28166 |
| Missing (%) | 47.4% |
| Memory size | 3.4 MiB |
| K | 682 |
|---|---|
| None | 644 |
| Borehole | 546 |
| Chalinze wate | 405 |
| M | 400 |
| Other values (2691) |
| Value | Count | Frequency (%) | |
| K | 682 | 1.1% | |
| None | 644 | 1.1% | |
| Borehole | 546 | 0.9% | |
| Chalinze wate | 405 | 0.7% | |
| M | 400 | 0.7% | |
| DANIDA | 379 | 0.6% | |
| Government | 320 | 0.5% | |
| Ngana water supplied scheme | 270 | 0.5% | |
| wanging'ombe water supply s | 261 | 0.4% | |
| wanging'ombe supply scheme | 234 | 0.4% | |
| I | 229 | 0.4% | |
| Bagamoyo wate | 229 | 0.4% | |
| Uroki-Bomang'ombe water sup | 209 | 0.4% | |
| N | 204 | 0.3% | |
| Kirua kahe gravity water supply trust | 193 | 0.3% | |
| Machumba estate pipe line | 185 | 0.3% | |
| Makwale water supplied sche | 166 | 0.3% | |
| Kijiji | 161 | 0.3% | |
| S | 154 | 0.3% | |
| mtwango water supply scheme | 152 | 0.3% | |
| Handeni Trunk Main(H | 152 | 0.3% | |
| Losaa-Kia water supply | 152 | 0.3% | |
| Mkongoro Two | 147 | 0.2% | |
| Roman | 139 | 0.2% | |
| Mkongoro One | 128 | 0.2% | |
| Other values (2671) | 24493 | 41.2% | |
| (Missing) | 28166 | 47.4% |
Length
| Max length | 46 |
|---|---|
| Median length | 3 |
| Mean length | 8.94456229 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 76750 | 14.4% | |
| n | 74092 | 13.9% | |
| 41252 | 7.8% | ||
| e | 35239 | 6.6% | |
| i | 26411 | 5.0% | |
| p | 22451 | 4.2% | |
| r | 21816 | 4.1% | |
| t | 19216 | 3.6% | |
| u | 18441 | 3.5% | |
| o | 17418 | 3.3% | |
| l | 17308 | 3.3% | |
| s | 16430 | 3.1% | |
| w | 16361 | 3.1% | |
| m | 14147 | 2.7% | |
| y | 12156 | 2.3% | |
| g | 11340 | 2.1% | |
| M | 9314 | 1.8% | |
| h | 8046 | 1.5% | |
| K | 5600 | 1.1% | |
| d | 5538 | 1.0% | |
| k | 5388 | 1.0% | |
| b | 5135 | 1.0% | |
| c | 4978 | 0.9% | |
| N | 4439 | 0.8% | |
| S | 3770 | 0.7% | |
| Other values (43) | 38271 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 437681 | 82.4% | |
| Uppercase Letter | 50064 | 9.4% | |
| Space Separator | 41252 | 7.8% | |
| Other Punctuation | 1317 | 0.2% | |
| Dash Punctuation | 554 | 0.1% | |
| Open Punctuation | 191 | < 0.1% | |
| Decimal Number | 147 | < 0.1% | |
| Modifier Symbol | 70 | < 0.1% | |
| Close Punctuation | 31 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 9314 | 18.6% | |
| K | 5600 | 11.2% | |
| N | 4439 | 8.9% | |
| S | 3770 | 7.5% | |
| A | 2729 | 5.5% | |
| I | 2691 | 5.4% | |
| W | 2531 | 5.1% | |
| B | 2387 | 4.8% | |
| L | 2107 | 4.2% | |
| U | 1790 | 3.6% | |
| D | 1576 | 3.1% | |
| T | 1550 | 3.1% | |
| C | 1527 | 3.1% | |
| R | 1407 | 2.8% | |
| E | 1336 | 2.7% | |
| P | 1047 | 2.1% | |
| H | 1023 | 2.0% | |
| O | 955 | 1.9% | |
| G | 899 | 1.8% | |
| J | 385 | 0.8% | |
| V | 369 | 0.7% | |
| Y | 268 | 0.5% | |
| F | 224 | 0.4% | |
| Z | 98 | 0.2% | |
| Q | 42 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 76750 | 17.5% | |
| n | 74092 | 16.9% | |
| e | 35239 | 8.1% | |
| i | 26411 | 6.0% | |
| p | 22451 | 5.1% | |
| r | 21816 | 5.0% | |
| t | 19216 | 4.4% | |
| u | 18441 | 4.2% | |
| o | 17418 | 4.0% | |
| l | 17308 | 4.0% | |
| s | 16430 | 3.8% | |
| w | 16361 | 3.7% | |
| m | 14147 | 3.2% | |
| y | 12156 | 2.8% | |
| g | 11340 | 2.6% | |
| h | 8046 | 1.8% | |
| d | 5538 | 1.3% | |
| k | 5388 | 1.2% | |
| b | 5135 | 1.2% | |
| c | 4978 | 1.1% | |
| v | 3255 | 0.7% | |
| j | 3062 | 0.7% | |
| z | 1708 | 0.4% | |
| f | 955 | 0.2% | |
| q | 36 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 41252 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 938 | 71.2% | |
| / | 370 | 28.1% | |
| & | 8 | 0.6% | |
| : | 1 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 554 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 61 | 41.5% | |
| 3 | 55 | 37.4% | |
| 7 | 7 | 4.8% | |
| 1 | 7 | 4.8% | |
| 4 | 7 | 4.8% | |
| 5 | 4 | 2.7% | |
| 0 | 3 | 2.0% | |
| 6 | 3 | 2.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 191 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 31 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 70 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 487745 | 91.8% | |
| Common | 43562 | 8.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 76750 | 15.7% | |
| n | 74092 | 15.2% | |
| e | 35239 | 7.2% | |
| i | 26411 | 5.4% | |
| p | 22451 | 4.6% | |
| r | 21816 | 4.5% | |
| t | 19216 | 3.9% | |
| u | 18441 | 3.8% | |
| o | 17418 | 3.6% | |
| l | 17308 | 3.5% | |
| s | 16430 | 3.4% | |
| w | 16361 | 3.4% | |
| m | 14147 | 2.9% | |
| y | 12156 | 2.5% | |
| g | 11340 | 2.3% | |
| M | 9314 | 1.9% | |
| h | 8046 | 1.6% | |
| K | 5600 | 1.1% | |
| d | 5538 | 1.1% | |
| k | 5388 | 1.1% | |
| b | 5135 | 1.1% | |
| c | 4978 | 1.0% | |
| N | 4439 | 0.9% | |
| S | 3770 | 0.8% | |
| v | 3255 | 0.7% | |
| Other values (26) | 32706 | 6.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 41252 | 94.7% | ||
| ' | 938 | 2.2% | |
| - | 554 | 1.3% | |
| / | 370 | 0.8% | |
| ( | 191 | 0.4% | |
| ` | 70 | 0.2% | |
| 2 | 61 | 0.1% | |
| 3 | 55 | 0.1% | |
| ) | 31 | 0.1% | |
| & | 8 | < 0.1% | |
| 7 | 7 | < 0.1% | |
| 1 | 7 | < 0.1% | |
| 4 | 7 | < 0.1% | |
| 5 | 4 | < 0.1% | |
| 0 | 3 | < 0.1% | |
| 6 | 3 | < 0.1% | |
| : | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 531307 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 76750 | 14.4% | |
| n | 74092 | 13.9% | |
| 41252 | 7.8% | ||
| e | 35239 | 6.6% | |
| i | 26411 | 5.0% | |
| p | 22451 | 4.2% | |
| r | 21816 | 4.1% | |
| t | 19216 | 3.6% | |
| u | 18441 | 3.5% | |
| o | 17418 | 3.3% | |
| l | 17308 | 3.3% | |
| s | 16430 | 3.1% | |
| w | 16361 | 3.1% | |
| m | 14147 | 2.7% | |
| y | 12156 | 2.3% | |
| g | 11340 | 2.1% | |
| M | 9314 | 1.8% | |
| h | 8046 | 1.5% | |
| K | 5600 | 1.1% | |
| d | 5538 | 1.0% | |
| k | 5388 | 1.0% | |
| b | 5135 | 1.0% | |
| c | 4978 | 0.9% | |
| N | 4439 | 0.8% | |
| S | 3770 | 0.7% | |
| Other values (43) | 38271 | 7.2% |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 3056 |
| Missing (%) | 5.1% |
| Memory size | 3.4 MiB |
| True | |
|---|---|
| False | |
| (Missing) | 3056 |
| Value | Count | Frequency (%) | |
| True | 38852 | 65.4% | |
| False | 17492 | 29.4% | |
| (Missing) | 3056 | 5.1% |
| Distinct count | 55 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1300.6524747474748 |
|---|---|
| Minimum | 0 |
| Maximum | 2013 |
| Zeros | 20709 |
| Zeros (%) | 34.9% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1986 |
| Q3 | 2004 |
| 95-th percentile | 2010 |
| Maximum | 2013 |
| Range | 2013 |
| Interquartile range (IQR) | 2004 |
Descriptive statistics
| Standard deviation | 951.6205473 |
|---|---|
| Coefficient of variation (CV) | 0.7316485885 |
| Kurtosis | -1.596432369 |
| Mean | 1300.652475 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | -0.6349277866 |
| Sum | 77258757 |
| Variance | 905581.6661 |
| Value | Count | Frequency (%) | |
| 0 | 20709 | 34.9% | |
| 2010 | 2645 | 4.5% | |
| 2008 | 2613 | 4.4% | |
| 2009 | 2533 | 4.3% | |
| 2000 | 2091 | 3.5% | |
| 2007 | 1587 | 2.7% | |
| 2006 | 1471 | 2.5% | |
| 2003 | 1286 | 2.2% | |
| 2011 | 1256 | 2.1% | |
| 2004 | 1123 | 1.9% | |
| 2012 | 1084 | 1.8% | |
| 2002 | 1075 | 1.8% | |
| 1978 | 1037 | 1.7% | |
| 1995 | 1014 | 1.7% | |
| 2005 | 1011 | 1.7% | |
| 1999 | 979 | 1.6% | |
| 1998 | 966 | 1.6% | |
| 1990 | 954 | 1.6% | |
| 1985 | 945 | 1.6% | |
| 1980 | 811 | 1.4% | |
| 1996 | 811 | 1.4% | |
| 1984 | 779 | 1.3% | |
| 1982 | 744 | 1.3% | |
| 1994 | 738 | 1.2% | |
| 1972 | 708 | 1.2% | |
| Other values (30) | 8430 | 14.2% |
| Value | Count | Frequency (%) | |
| 0 | 20709 | 34.9% | |
| 1960 | 102 | 0.2% | |
| 1961 | 21 | < 0.1% | |
| 1962 | 30 | 0.1% | |
| 1963 | 85 | 0.1% | |
| 1964 | 40 | 0.1% | |
| 1965 | 19 | < 0.1% | |
| 1966 | 17 | < 0.1% | |
| 1967 | 88 | 0.1% | |
| 1968 | 77 | 0.1% |
| Value | Count | Frequency (%) | |
| 2013 | 176 | 0.3% | |
| 2012 | 1084 | 1.8% | |
| 2011 | 1256 | 2.1% | |
| 2010 | 2645 | 4.5% | |
| 2009 | 2533 | 4.3% | |
| 2008 | 2613 | 4.4% | |
| 2007 | 1587 | 2.7% | |
| 2006 | 1471 | 2.5% | |
| 2005 | 1011 | 1.7% | |
| 2004 | 1123 | 1.9% |
| Distinct count | 18 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| gravity | |
|---|---|
| nira/tanira | |
| other | |
| submersible | 4764 |
| swn 80 | 3670 |
| Other values (13) |
| Value | Count | Frequency (%) | |
| gravity | 26780 | 45.1% | |
| nira/tanira | 8154 | 13.7% | |
| other | 6430 | 10.8% | |
| submersible | 4764 | 8.0% | |
| swn 80 | 3670 | 6.2% | |
| mono | 2865 | 4.8% | |
| india mark ii | 2400 | 4.0% | |
| afridev | 1770 | 3.0% | |
| ksb | 1415 | 2.4% | |
| other - rope pump | 451 | 0.8% | |
| other - swn 81 | 229 | 0.4% | |
| windmill | 117 | 0.2% | |
| india mark iii | 98 | 0.2% | |
| cemo | 90 | 0.2% | |
| other - play pump | 85 | 0.1% | |
| walimi | 48 | 0.1% | |
| climax | 32 | 0.1% | |
| other - mkulima/shinyanga | 2 | < 0.1% |
Length
| Max length | 25 |
|---|---|
| Median length | 7 |
| Mean length | 7.719511785 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| i | 60078 | 13.1% | |
| r | 59768 | 13.0% | |
| a | 58179 | 12.7% | |
| t | 42131 | 9.2% | |
| v | 28550 | 6.2% | |
| y | 26867 | 5.9% | |
| g | 26782 | 5.8% | |
| n | 25691 | 5.6% | |
| e | 19036 | 4.2% | |
| s | 14844 | 3.2% | |
| o | 13468 | 2.9% | |
| 10965 | 2.4% | ||
| m | 10954 | 2.4% | |
| b | 10943 | 2.4% | |
| / | 8156 | 1.8% | |
| h | 7199 | 1.6% | |
| u | 5302 | 1.2% | |
| l | 5165 | 1.1% | |
| d | 4385 | 1.0% | |
| w | 4064 | 0.9% | |
| k | 3915 | 0.9% | |
| 8 | 3899 | 0.9% | |
| 0 | 3670 | 0.8% | |
| f | 1770 | 0.4% | |
| p | 1608 | 0.4% | |
| Other values (4) | 1150 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 430853 | 94.0% | |
| Space Separator | 10965 | 2.4% | |
| Other Punctuation | 8156 | 1.8% | |
| Decimal Number | 7798 | 1.7% | |
| Dash Punctuation | 767 | 0.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| i | 60078 | 13.9% | |
| r | 59768 | 13.9% | |
| a | 58179 | 13.5% | |
| t | 42131 | 9.8% | |
| v | 28550 | 6.6% | |
| y | 26867 | 6.2% | |
| g | 26782 | 6.2% | |
| n | 25691 | 6.0% | |
| e | 19036 | 4.4% | |
| s | 14844 | 3.4% | |
| o | 13468 | 3.1% | |
| m | 10954 | 2.5% | |
| b | 10943 | 2.5% | |
| h | 7199 | 1.7% | |
| u | 5302 | 1.2% | |
| l | 5165 | 1.2% | |
| d | 4385 | 1.0% | |
| w | 4064 | 0.9% | |
| k | 3915 | 0.9% | |
| f | 1770 | 0.4% | |
| p | 1608 | 0.4% | |
| c | 122 | < 0.1% | |
| x | 32 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 10965 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 8 | 3899 | 50.0% | |
| 0 | 3670 | 47.1% | |
| 1 | 229 | 2.9% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 8156 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 767 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 430853 | 94.0% | |
| Common | 27686 | 6.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| i | 60078 | 13.9% | |
| r | 59768 | 13.9% | |
| a | 58179 | 13.5% | |
| t | 42131 | 9.8% | |
| v | 28550 | 6.6% | |
| y | 26867 | 6.2% | |
| g | 26782 | 6.2% | |
| n | 25691 | 6.0% | |
| e | 19036 | 4.4% | |
| s | 14844 | 3.4% | |
| o | 13468 | 3.1% | |
| m | 10954 | 2.5% | |
| b | 10943 | 2.5% | |
| h | 7199 | 1.7% | |
| u | 5302 | 1.2% | |
| l | 5165 | 1.2% | |
| d | 4385 | 1.0% | |
| w | 4064 | 0.9% | |
| k | 3915 | 0.9% | |
| f | 1770 | 0.4% | |
| p | 1608 | 0.4% | |
| c | 122 | < 0.1% | |
| x | 32 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 10965 | 39.6% | ||
| / | 8156 | 29.5% | |
| 8 | 3899 | 14.1% | |
| 0 | 3670 | 13.3% | |
| - | 767 | 2.8% | |
| 1 | 229 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 458539 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| i | 60078 | 13.1% | |
| r | 59768 | 13.0% | |
| a | 58179 | 12.7% | |
| t | 42131 | 9.2% | |
| v | 28550 | 6.2% | |
| y | 26867 | 5.9% | |
| g | 26782 | 5.8% | |
| n | 25691 | 5.6% | |
| e | 19036 | 4.2% | |
| s | 14844 | 3.2% | |
| o | 13468 | 2.9% | |
| 10965 | 2.4% | ||
| m | 10954 | 2.4% | |
| b | 10943 | 2.4% | |
| / | 8156 | 1.8% | |
| h | 7199 | 1.6% | |
| u | 5302 | 1.2% | |
| l | 5165 | 1.1% | |
| d | 4385 | 1.0% | |
| w | 4064 | 0.9% | |
| k | 3915 | 0.9% | |
| 8 | 3899 | 0.9% | |
| 0 | 3670 | 0.8% | |
| f | 1770 | 0.4% | |
| p | 1608 | 0.4% | |
| Other values (4) | 1150 | 0.3% |
| Distinct count | 13 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| gravity | |
|---|---|
| nira/tanira | |
| other | |
| submersible | |
| swn 80 | 3670 |
| Other values (8) |
| Value | Count | Frequency (%) | |
| gravity | 26780 | 45.1% | |
| nira/tanira | 8154 | 13.7% | |
| other | 6430 | 10.8% | |
| submersible | 6179 | 10.4% | |
| swn 80 | 3670 | 6.2% | |
| mono | 2865 | 4.8% | |
| india mark ii | 2400 | 4.0% | |
| afridev | 1770 | 3.0% | |
| rope pump | 451 | 0.8% | |
| other handpump | 364 | 0.6% | |
| other motorpump | 122 | 0.2% | |
| wind-powered | 117 | 0.2% | |
| india mark iii | 98 | 0.2% |
Length
| Max length | 15 |
|---|---|
| Median length | 7 |
| Mean length | 7.880538721 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| i | 61244 | 13.1% | |
| r | 61141 | 13.1% | |
| a | 58372 | 12.5% | |
| t | 41972 | 9.0% | |
| v | 28550 | 6.1% | |
| g | 26780 | 5.7% | |
| y | 26780 | 5.7% | |
| n | 25822 | 5.5% | |
| e | 21729 | 4.6% | |
| s | 16028 | 3.4% | |
| o | 13458 | 2.9% | |
| m | 12601 | 2.7% | |
| b | 12358 | 2.6% | |
| 9603 | 2.1% | ||
| / | 8154 | 1.7% | |
| h | 7280 | 1.6% | |
| u | 7116 | 1.5% | |
| l | 6179 | 1.3% | |
| d | 4866 | 1.0% | |
| w | 3904 | 0.8% | |
| 8 | 3670 | 0.8% | |
| 0 | 3670 | 0.8% | |
| k | 2498 | 0.5% | |
| p | 2442 | 0.5% | |
| f | 1770 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 442890 | 94.6% | |
| Space Separator | 9603 | 2.1% | |
| Other Punctuation | 8154 | 1.7% | |
| Decimal Number | 7340 | 1.6% | |
| Dash Punctuation | 117 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| i | 61244 | 13.8% | |
| r | 61141 | 13.8% | |
| a | 58372 | 13.2% | |
| t | 41972 | 9.5% | |
| v | 28550 | 6.4% | |
| g | 26780 | 6.0% | |
| y | 26780 | 6.0% | |
| n | 25822 | 5.8% | |
| e | 21729 | 4.9% | |
| s | 16028 | 3.6% | |
| o | 13458 | 3.0% | |
| m | 12601 | 2.8% | |
| b | 12358 | 2.8% | |
| h | 7280 | 1.6% | |
| u | 7116 | 1.6% | |
| l | 6179 | 1.4% | |
| d | 4866 | 1.1% | |
| w | 3904 | 0.9% | |
| k | 2498 | 0.6% | |
| p | 2442 | 0.6% | |
| f | 1770 | 0.4% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 9603 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 8 | 3670 | 50.0% | |
| 0 | 3670 | 50.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 8154 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 117 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 442890 | 94.6% | |
| Common | 25214 | 5.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| i | 61244 | 13.8% | |
| r | 61141 | 13.8% | |
| a | 58372 | 13.2% | |
| t | 41972 | 9.5% | |
| v | 28550 | 6.4% | |
| g | 26780 | 6.0% | |
| y | 26780 | 6.0% | |
| n | 25822 | 5.8% | |
| e | 21729 | 4.9% | |
| s | 16028 | 3.6% | |
| o | 13458 | 3.0% | |
| m | 12601 | 2.8% | |
| b | 12358 | 2.8% | |
| h | 7280 | 1.6% | |
| u | 7116 | 1.6% | |
| l | 6179 | 1.4% | |
| d | 4866 | 1.1% | |
| w | 3904 | 0.9% | |
| k | 2498 | 0.6% | |
| p | 2442 | 0.6% | |
| f | 1770 | 0.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 9603 | 38.1% | ||
| / | 8154 | 32.3% | |
| 8 | 3670 | 14.6% | |
| 0 | 3670 | 14.6% | |
| - | 117 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 468104 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| i | 61244 | 13.1% | |
| r | 61141 | 13.1% | |
| a | 58372 | 12.5% | |
| t | 41972 | 9.0% | |
| v | 28550 | 6.1% | |
| g | 26780 | 5.7% | |
| y | 26780 | 5.7% | |
| n | 25822 | 5.5% | |
| e | 21729 | 4.6% | |
| s | 16028 | 3.4% | |
| o | 13458 | 2.9% | |
| m | 12601 | 2.7% | |
| b | 12358 | 2.6% | |
| 9603 | 2.1% | ||
| / | 8154 | 1.7% | |
| h | 7280 | 1.6% | |
| u | 7116 | 1.5% | |
| l | 6179 | 1.3% | |
| d | 4866 | 1.0% | |
| w | 3904 | 0.8% | |
| 8 | 3670 | 0.8% | |
| 0 | 3670 | 0.8% | |
| k | 2498 | 0.5% | |
| p | 2442 | 0.5% | |
| f | 1770 | 0.4% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| gravity | |
|---|---|
| handpump | |
| other | |
| submersible | |
| motorpump | 2987 |
| Other values (2) | 568 |
| Value | Count | Frequency (%) | |
| gravity | 26780 | 45.1% | |
| handpump | 16456 | 27.7% | |
| other | 6430 | 10.8% | |
| submersible | 6179 | 10.4% | |
| motorpump | 2987 | 5.0% | |
| rope pump | 451 | 0.8% | |
| wind-powered | 117 | 0.2% |
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 7.602239057 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 43236 | 9.6% | |
| r | 42944 | 9.5% | |
| p | 40356 | 8.9% | |
| t | 36197 | 8.0% | |
| i | 33076 | 7.3% | |
| m | 29060 | 6.4% | |
| g | 26780 | 5.9% | |
| v | 26780 | 5.9% | |
| y | 26780 | 5.9% | |
| u | 26073 | 5.8% | |
| h | 22886 | 5.1% | |
| e | 19473 | 4.3% | |
| d | 16690 | 3.7% | |
| n | 16573 | 3.7% | |
| o | 12972 | 2.9% | |
| s | 12358 | 2.7% | |
| b | 12358 | 2.7% | |
| l | 6179 | 1.4% | |
| 451 | 0.1% | ||
| w | 234 | 0.1% | |
| - | 117 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 451005 | 99.9% | |
| Space Separator | 451 | 0.1% | |
| Dash Punctuation | 117 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 43236 | 9.6% | |
| r | 42944 | 9.5% | |
| p | 40356 | 8.9% | |
| t | 36197 | 8.0% | |
| i | 33076 | 7.3% | |
| m | 29060 | 6.4% | |
| g | 26780 | 5.9% | |
| v | 26780 | 5.9% | |
| y | 26780 | 5.9% | |
| u | 26073 | 5.8% | |
| h | 22886 | 5.1% | |
| e | 19473 | 4.3% | |
| d | 16690 | 3.7% | |
| n | 16573 | 3.7% | |
| o | 12972 | 2.9% | |
| s | 12358 | 2.7% | |
| b | 12358 | 2.7% | |
| l | 6179 | 1.4% | |
| w | 234 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 117 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 451 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 451005 | 99.9% | |
| Common | 568 | 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 43236 | 9.6% | |
| r | 42944 | 9.5% | |
| p | 40356 | 8.9% | |
| t | 36197 | 8.0% | |
| i | 33076 | 7.3% | |
| m | 29060 | 6.4% | |
| g | 26780 | 5.9% | |
| v | 26780 | 5.9% | |
| y | 26780 | 5.9% | |
| u | 26073 | 5.8% | |
| h | 22886 | 5.1% | |
| e | 19473 | 4.3% | |
| d | 16690 | 3.7% | |
| n | 16573 | 3.7% | |
| o | 12972 | 2.9% | |
| s | 12358 | 2.7% | |
| b | 12358 | 2.7% | |
| l | 6179 | 1.4% | |
| w | 234 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 451 | 79.4% | ||
| - | 117 | 20.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 451573 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 43236 | 9.6% | |
| r | 42944 | 9.5% | |
| p | 40356 | 8.9% | |
| t | 36197 | 8.0% | |
| i | 33076 | 7.3% | |
| m | 29060 | 6.4% | |
| g | 26780 | 5.9% | |
| v | 26780 | 5.9% | |
| y | 26780 | 5.9% | |
| u | 26073 | 5.8% | |
| h | 22886 | 5.1% | |
| e | 19473 | 4.3% | |
| d | 16690 | 3.7% | |
| n | 16573 | 3.7% | |
| o | 12972 | 2.9% | |
| s | 12358 | 2.7% | |
| b | 12358 | 2.7% | |
| l | 6179 | 1.4% | |
| 451 | 0.1% | ||
| w | 234 | 0.1% | |
| - | 117 | < 0.1% |
| Distinct count | 12 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| vwc | |
|---|---|
| wug | 6515 |
| water board | 2933 |
| wua | 2535 |
| private operator | 1971 |
| Other values (7) | 4939 |
| Value | Count | Frequency (%) | |
| vwc | 40507 | 68.2% | |
| wug | 6515 | 11.0% | |
| water board | 2933 | 4.9% | |
| wua | 2535 | 4.3% | |
| private operator | 1971 | 3.3% | |
| parastatal | 1768 | 3.0% | |
| water authority | 904 | 1.5% | |
| other | 844 | 1.4% | |
| company | 685 | 1.2% | |
| unknown | 561 | 0.9% | |
| other - school | 99 | 0.2% | |
| trust | 78 | 0.1% |
Length
| Max length | 16 |
|---|---|
| Median length | 3 |
| Mean length | 4.350639731 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| w | 53955 | 20.9% | |
| v | 42478 | 16.4% | |
| c | 41291 | 16.0% | |
| a | 21908 | 8.5% | |
| r | 16376 | 6.3% | |
| t | 14222 | 5.5% | |
| u | 10593 | 4.1% | |
| o | 10166 | 3.9% | |
| e | 8722 | 3.4% | |
| g | 6515 | 2.5% | |
| p | 6395 | 2.5% | |
| 6006 | 2.3% | ||
| b | 2933 | 1.1% | |
| d | 2933 | 1.1% | |
| i | 2875 | 1.1% | |
| n | 2368 | 0.9% | |
| h | 1946 | 0.8% | |
| s | 1945 | 0.8% | |
| l | 1867 | 0.7% | |
| y | 1589 | 0.6% | |
| m | 685 | 0.3% | |
| k | 561 | 0.2% | |
| - | 99 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 252323 | 97.6% | |
| Space Separator | 6006 | 2.3% | |
| Dash Punctuation | 99 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| w | 53955 | 21.4% | |
| v | 42478 | 16.8% | |
| c | 41291 | 16.4% | |
| a | 21908 | 8.7% | |
| r | 16376 | 6.5% | |
| t | 14222 | 5.6% | |
| u | 10593 | 4.2% | |
| o | 10166 | 4.0% | |
| e | 8722 | 3.5% | |
| g | 6515 | 2.6% | |
| p | 6395 | 2.5% | |
| b | 2933 | 1.2% | |
| d | 2933 | 1.2% | |
| i | 2875 | 1.1% | |
| n | 2368 | 0.9% | |
| h | 1946 | 0.8% | |
| s | 1945 | 0.8% | |
| l | 1867 | 0.7% | |
| y | 1589 | 0.6% | |
| m | 685 | 0.3% | |
| k | 561 | 0.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 6006 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 99 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 252323 | 97.6% | |
| Common | 6105 | 2.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| w | 53955 | 21.4% | |
| v | 42478 | 16.8% | |
| c | 41291 | 16.4% | |
| a | 21908 | 8.7% | |
| r | 16376 | 6.5% | |
| t | 14222 | 5.6% | |
| u | 10593 | 4.2% | |
| o | 10166 | 4.0% | |
| e | 8722 | 3.5% | |
| g | 6515 | 2.6% | |
| p | 6395 | 2.5% | |
| b | 2933 | 1.2% | |
| d | 2933 | 1.2% | |
| i | 2875 | 1.1% | |
| n | 2368 | 0.9% | |
| h | 1946 | 0.8% | |
| s | 1945 | 0.8% | |
| l | 1867 | 0.7% | |
| y | 1589 | 0.6% | |
| m | 685 | 0.3% | |
| k | 561 | 0.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 6006 | 98.4% | ||
| - | 99 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 258428 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| w | 53955 | 20.9% | |
| v | 42478 | 16.4% | |
| c | 41291 | 16.0% | |
| a | 21908 | 8.5% | |
| r | 16376 | 6.3% | |
| t | 14222 | 5.5% | |
| u | 10593 | 4.1% | |
| o | 10166 | 3.9% | |
| e | 8722 | 3.4% | |
| g | 6515 | 2.5% | |
| p | 6395 | 2.5% | |
| 6006 | 2.3% | ||
| b | 2933 | 1.1% | |
| d | 2933 | 1.1% | |
| i | 2875 | 1.1% | |
| n | 2368 | 0.9% | |
| h | 1946 | 0.8% | |
| s | 1945 | 0.8% | |
| l | 1867 | 0.7% | |
| y | 1589 | 0.6% | |
| m | 685 | 0.3% | |
| k | 561 | 0.2% | |
| - | 99 | < 0.1% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| user-group | |
|---|---|
| commercial | 3638 |
| parastatal | 1768 |
| other | 943 |
| unknown | 561 |
| Value | Count | Frequency (%) | |
| user-group | 52490 | 88.4% | |
| commercial | 3638 | 6.1% | |
| parastatal | 1768 | 3.0% | |
| other | 943 | 1.6% | |
| unknown | 561 | 0.9% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.892289562 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| r | 111329 | 18.9% | |
| u | 105541 | 18.0% | |
| o | 57632 | 9.8% | |
| e | 57071 | 9.7% | |
| s | 54258 | 9.2% | |
| p | 54258 | 9.2% | |
| - | 52490 | 8.9% | |
| g | 52490 | 8.9% | |
| a | 10710 | 1.8% | |
| c | 7276 | 1.2% | |
| m | 7276 | 1.2% | |
| l | 5406 | 0.9% | |
| t | 4479 | 0.8% | |
| i | 3638 | 0.6% | |
| n | 1683 | 0.3% | |
| h | 943 | 0.2% | |
| k | 561 | 0.1% | |
| w | 561 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 535112 | 91.1% | |
| Dash Punctuation | 52490 | 8.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| r | 111329 | 20.8% | |
| u | 105541 | 19.7% | |
| o | 57632 | 10.8% | |
| e | 57071 | 10.7% | |
| s | 54258 | 10.1% | |
| p | 54258 | 10.1% | |
| g | 52490 | 9.8% | |
| a | 10710 | 2.0% | |
| c | 7276 | 1.4% | |
| m | 7276 | 1.4% | |
| l | 5406 | 1.0% | |
| t | 4479 | 0.8% | |
| i | 3638 | 0.7% | |
| n | 1683 | 0.3% | |
| h | 943 | 0.2% | |
| k | 561 | 0.1% | |
| w | 561 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 52490 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 535112 | 91.1% | |
| Common | 52490 | 8.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| r | 111329 | 20.8% | |
| u | 105541 | 19.7% | |
| o | 57632 | 10.8% | |
| e | 57071 | 10.7% | |
| s | 54258 | 10.1% | |
| p | 54258 | 10.1% | |
| g | 52490 | 9.8% | |
| a | 10710 | 2.0% | |
| c | 7276 | 1.4% | |
| m | 7276 | 1.4% | |
| l | 5406 | 1.0% | |
| t | 4479 | 0.8% | |
| i | 3638 | 0.7% | |
| n | 1683 | 0.3% | |
| h | 943 | 0.2% | |
| k | 561 | 0.1% | |
| w | 561 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 52490 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 587602 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| r | 111329 | 18.9% | |
| u | 105541 | 18.0% | |
| o | 57632 | 9.8% | |
| e | 57071 | 9.7% | |
| s | 54258 | 9.2% | |
| p | 54258 | 9.2% | |
| - | 52490 | 8.9% | |
| g | 52490 | 8.9% | |
| a | 10710 | 1.8% | |
| c | 7276 | 1.2% | |
| m | 7276 | 1.2% | |
| l | 5406 | 0.9% | |
| t | 4479 | 0.8% | |
| i | 3638 | 0.6% | |
| n | 1683 | 0.3% | |
| h | 943 | 0.2% | |
| k | 561 | 0.1% | |
| w | 561 | 0.1% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| never pay | |
|---|---|
| pay per bucket | |
| pay monthly | |
| unknown | |
| pay when scheme fails | 3914 |
| Other values (2) | 4696 |
| Value | Count | Frequency (%) | |
| never pay | 25348 | 42.7% | |
| pay per bucket | 8985 | 15.1% | |
| pay monthly | 8300 | 14.0% | |
| unknown | 8157 | 13.7% | |
| pay when scheme fails | 3914 | 6.6% | |
| pay annually | 3642 | 6.1% | |
| other | 1054 | 1.8% |
Length
| Max length | 21 |
|---|---|
| Median length | 9 |
| Mean length | 10.66479798 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 81462 | 12.9% | |
| n | 69317 | 10.9% | |
| 67002 | 10.6% | ||
| y | 62131 | 9.8% | |
| a | 61387 | 9.7% | |
| p | 59174 | 9.3% | |
| r | 35387 | 5.6% | |
| v | 25348 | 4.0% | |
| u | 20784 | 3.3% | |
| l | 19498 | 3.1% | |
| t | 18339 | 2.9% | |
| o | 17511 | 2.8% | |
| h | 17182 | 2.7% | |
| k | 17142 | 2.7% | |
| c | 12899 | 2.0% | |
| m | 12214 | 1.9% | |
| w | 12071 | 1.9% | |
| b | 8985 | 1.4% | |
| s | 7828 | 1.2% | |
| f | 3914 | 0.6% | |
| i | 3914 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 566487 | 89.4% | |
| Space Separator | 67002 | 10.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 81462 | 14.4% | |
| n | 69317 | 12.2% | |
| y | 62131 | 11.0% | |
| a | 61387 | 10.8% | |
| p | 59174 | 10.4% | |
| r | 35387 | 6.2% | |
| v | 25348 | 4.5% | |
| u | 20784 | 3.7% | |
| l | 19498 | 3.4% | |
| t | 18339 | 3.2% | |
| o | 17511 | 3.1% | |
| h | 17182 | 3.0% | |
| k | 17142 | 3.0% | |
| c | 12899 | 2.3% | |
| m | 12214 | 2.2% | |
| w | 12071 | 2.1% | |
| b | 8985 | 1.6% | |
| s | 7828 | 1.4% | |
| f | 3914 | 0.7% | |
| i | 3914 | 0.7% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 67002 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 566487 | 89.4% | |
| Common | 67002 | 10.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 81462 | 14.4% | |
| n | 69317 | 12.2% | |
| y | 62131 | 11.0% | |
| a | 61387 | 10.8% | |
| p | 59174 | 10.4% | |
| r | 35387 | 6.2% | |
| v | 25348 | 4.5% | |
| u | 20784 | 3.7% | |
| l | 19498 | 3.4% | |
| t | 18339 | 3.2% | |
| o | 17511 | 3.1% | |
| h | 17182 | 3.0% | |
| k | 17142 | 3.0% | |
| c | 12899 | 2.3% | |
| m | 12214 | 2.2% | |
| w | 12071 | 2.1% | |
| b | 8985 | 1.6% | |
| s | 7828 | 1.4% | |
| f | 3914 | 0.7% | |
| i | 3914 | 0.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 67002 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 633489 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 81462 | 12.9% | |
| n | 69317 | 10.9% | |
| 67002 | 10.6% | ||
| y | 62131 | 9.8% | |
| a | 61387 | 9.7% | |
| p | 59174 | 9.3% | |
| r | 35387 | 5.6% | |
| v | 25348 | 4.0% | |
| u | 20784 | 3.3% | |
| l | 19498 | 3.1% | |
| t | 18339 | 2.9% | |
| o | 17511 | 2.8% | |
| h | 17182 | 2.7% | |
| k | 17142 | 2.7% | |
| c | 12899 | 2.0% | |
| m | 12214 | 1.9% | |
| w | 12071 | 1.9% | |
| b | 8985 | 1.4% | |
| s | 7828 | 1.2% | |
| f | 3914 | 0.6% | |
| i | 3914 | 0.6% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| never pay | |
|---|---|
| per bucket | |
| monthly | |
| unknown | |
| on failure | 3914 |
| Other values (2) | 4696 |
| Value | Count | Frequency (%) | |
| never pay | 25348 | 42.7% | |
| per bucket | 8985 | 15.1% | |
| monthly | 8300 | 14.0% | |
| unknown | 8157 | 13.7% | |
| on failure | 3914 | 6.6% | |
| annually | 3642 | 6.1% | |
| other | 1054 | 1.8% |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.530757576 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 73634 | 14.5% | |
| n | 69317 | 13.7% | |
| r | 39301 | 7.8% | |
| 38247 | 7.5% | ||
| y | 37290 | 7.4% | |
| a | 36546 | 7.2% | |
| p | 34333 | 6.8% | |
| v | 25348 | 5.0% | |
| u | 24698 | 4.9% | |
| o | 21425 | 4.2% | |
| l | 19498 | 3.8% | |
| t | 18339 | 3.6% | |
| k | 17142 | 3.4% | |
| h | 9354 | 1.8% | |
| b | 8985 | 1.8% | |
| c | 8985 | 1.8% | |
| m | 8300 | 1.6% | |
| w | 8157 | 1.6% | |
| f | 3914 | 0.8% | |
| i | 3914 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 468480 | 92.5% | |
| Space Separator | 38247 | 7.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 73634 | 15.7% | |
| n | 69317 | 14.8% | |
| r | 39301 | 8.4% | |
| y | 37290 | 8.0% | |
| a | 36546 | 7.8% | |
| p | 34333 | 7.3% | |
| v | 25348 | 5.4% | |
| u | 24698 | 5.3% | |
| o | 21425 | 4.6% | |
| l | 19498 | 4.2% | |
| t | 18339 | 3.9% | |
| k | 17142 | 3.7% | |
| h | 9354 | 2.0% | |
| b | 8985 | 1.9% | |
| c | 8985 | 1.9% | |
| m | 8300 | 1.8% | |
| w | 8157 | 1.7% | |
| f | 3914 | 0.8% | |
| i | 3914 | 0.8% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 38247 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 468480 | 92.5% | |
| Common | 38247 | 7.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 73634 | 15.7% | |
| n | 69317 | 14.8% | |
| r | 39301 | 8.4% | |
| y | 37290 | 8.0% | |
| a | 36546 | 7.8% | |
| p | 34333 | 7.3% | |
| v | 25348 | 5.4% | |
| u | 24698 | 5.3% | |
| o | 21425 | 4.6% | |
| l | 19498 | 4.2% | |
| t | 18339 | 3.9% | |
| k | 17142 | 3.7% | |
| h | 9354 | 2.0% | |
| b | 8985 | 1.9% | |
| c | 8985 | 1.9% | |
| m | 8300 | 1.8% | |
| w | 8157 | 1.7% | |
| f | 3914 | 0.8% | |
| i | 3914 | 0.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 38247 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 506727 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 73634 | 14.5% | |
| n | 69317 | 13.7% | |
| r | 39301 | 7.8% | |
| 38247 | 7.5% | ||
| y | 37290 | 7.4% | |
| a | 36546 | 7.2% | |
| p | 34333 | 6.8% | |
| v | 25348 | 5.0% | |
| u | 24698 | 4.9% | |
| o | 21425 | 4.2% | |
| l | 19498 | 3.8% | |
| t | 18339 | 3.6% | |
| k | 17142 | 3.4% | |
| h | 9354 | 1.8% | |
| b | 8985 | 1.8% | |
| c | 8985 | 1.8% | |
| m | 8300 | 1.6% | |
| w | 8157 | 1.6% | |
| f | 3914 | 0.8% | |
| i | 3914 | 0.8% |
| Distinct count | 8 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| soft | |
|---|---|
| salty | 4856 |
| unknown | 1876 |
| milky | 804 |
| coloured | 490 |
| Other values (3) | 556 |
| Value | Count | Frequency (%) | |
| soft | 50818 | 85.6% | |
| salty | 4856 | 8.2% | |
| unknown | 1876 | 3.2% | |
| milky | 804 | 1.4% | |
| coloured | 490 | 0.8% | |
| salty abandoned | 339 | 0.6% | |
| fluoride | 200 | 0.3% | |
| fluoride abandoned | 17 | < 0.1% |
Length
| Max length | 18 |
|---|---|
| Median length | 4 |
| Mean length | 4.303282828 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| s | 56013 | 21.9% | |
| t | 56013 | 21.9% | |
| o | 54247 | 21.2% | |
| f | 51035 | 20.0% | |
| l | 6706 | 2.6% | |
| n | 6340 | 2.5% | |
| y | 5999 | 2.3% | |
| a | 5907 | 2.3% | |
| k | 2680 | 1.0% | |
| u | 2583 | 1.0% | |
| w | 1876 | 0.7% | |
| d | 1419 | 0.6% | |
| e | 1063 | 0.4% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| c | 490 | 0.2% | |
| 356 | 0.1% | ||
| b | 356 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 255259 | 99.9% | |
| Space Separator | 356 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| s | 56013 | 21.9% | |
| t | 56013 | 21.9% | |
| o | 54247 | 21.3% | |
| f | 51035 | 20.0% | |
| l | 6706 | 2.6% | |
| n | 6340 | 2.5% | |
| y | 5999 | 2.4% | |
| a | 5907 | 2.3% | |
| k | 2680 | 1.0% | |
| u | 2583 | 1.0% | |
| w | 1876 | 0.7% | |
| d | 1419 | 0.6% | |
| e | 1063 | 0.4% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| c | 490 | 0.2% | |
| b | 356 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 356 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 255259 | 99.9% | |
| Common | 356 | 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| s | 56013 | 21.9% | |
| t | 56013 | 21.9% | |
| o | 54247 | 21.3% | |
| f | 51035 | 20.0% | |
| l | 6706 | 2.6% | |
| n | 6340 | 2.5% | |
| y | 5999 | 2.4% | |
| a | 5907 | 2.3% | |
| k | 2680 | 1.0% | |
| u | 2583 | 1.0% | |
| w | 1876 | 0.7% | |
| d | 1419 | 0.6% | |
| e | 1063 | 0.4% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| c | 490 | 0.2% | |
| b | 356 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 356 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 255615 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| s | 56013 | 21.9% | |
| t | 56013 | 21.9% | |
| o | 54247 | 21.2% | |
| f | 51035 | 20.0% | |
| l | 6706 | 2.6% | |
| n | 6340 | 2.5% | |
| y | 5999 | 2.3% | |
| a | 5907 | 2.3% | |
| k | 2680 | 1.0% | |
| u | 2583 | 1.0% | |
| w | 1876 | 0.7% | |
| d | 1419 | 0.6% | |
| e | 1063 | 0.4% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| c | 490 | 0.2% | |
| 356 | 0.1% | ||
| b | 356 | 0.1% |
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| good | |
|---|---|
| salty | 5195 |
| unknown | 1876 |
| milky | 804 |
| colored | 490 |
| Value | Count | Frequency (%) | |
| good | 50818 | 85.6% | |
| salty | 5195 | 8.7% | |
| unknown | 1876 | 3.2% | |
| milky | 804 | 1.4% | |
| colored | 490 | 0.8% | |
| fluoride | 217 | 0.4% |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.23510101 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 104709 | 41.6% | |
| d | 51525 | 20.5% | |
| g | 50818 | 20.2% | |
| l | 6706 | 2.7% | |
| y | 5999 | 2.4% | |
| n | 5628 | 2.2% | |
| s | 5195 | 2.1% | |
| a | 5195 | 2.1% | |
| t | 5195 | 2.1% | |
| k | 2680 | 1.1% | |
| u | 2093 | 0.8% | |
| w | 1876 | 0.7% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| e | 707 | 0.3% | |
| c | 490 | 0.2% | |
| f | 217 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 251565 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 104709 | 41.6% | |
| d | 51525 | 20.5% | |
| g | 50818 | 20.2% | |
| l | 6706 | 2.7% | |
| y | 5999 | 2.4% | |
| n | 5628 | 2.2% | |
| s | 5195 | 2.1% | |
| a | 5195 | 2.1% | |
| t | 5195 | 2.1% | |
| k | 2680 | 1.1% | |
| u | 2093 | 0.8% | |
| w | 1876 | 0.7% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| e | 707 | 0.3% | |
| c | 490 | 0.2% | |
| f | 217 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 251565 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 104709 | 41.6% | |
| d | 51525 | 20.5% | |
| g | 50818 | 20.2% | |
| l | 6706 | 2.7% | |
| y | 5999 | 2.4% | |
| n | 5628 | 2.2% | |
| s | 5195 | 2.1% | |
| a | 5195 | 2.1% | |
| t | 5195 | 2.1% | |
| k | 2680 | 1.1% | |
| u | 2093 | 0.8% | |
| w | 1876 | 0.7% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| e | 707 | 0.3% | |
| c | 490 | 0.2% | |
| f | 217 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 251565 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 104709 | 41.6% | |
| d | 51525 | 20.5% | |
| g | 50818 | 20.2% | |
| l | 6706 | 2.7% | |
| y | 5999 | 2.4% | |
| n | 5628 | 2.2% | |
| s | 5195 | 2.1% | |
| a | 5195 | 2.1% | |
| t | 5195 | 2.1% | |
| k | 2680 | 1.1% | |
| u | 2093 | 0.8% | |
| w | 1876 | 0.7% | |
| i | 1021 | 0.4% | |
| m | 804 | 0.3% | |
| r | 707 | 0.3% | |
| e | 707 | 0.3% | |
| c | 490 | 0.2% | |
| f | 217 | 0.1% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| enough | |
|---|---|
| insufficient | |
| dry | 6246 |
| seasonal | 4050 |
| unknown | 789 |
| Value | Count | Frequency (%) | |
| enough | 33186 | 55.9% | |
| insufficient | 15129 | 25.5% | |
| dry | 6246 | 10.5% | |
| seasonal | 4050 | 6.8% | |
| unknown | 789 | 1.3% |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 7.362373737 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 437325 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 437325 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 437325 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| enough | |
|---|---|
| insufficient | |
| dry | 6246 |
| seasonal | 4050 |
| unknown | 789 |
| Value | Count | Frequency (%) | |
| enough | 33186 | 55.9% | |
| insufficient | 15129 | 25.5% | |
| dry | 6246 | 10.5% | |
| seasonal | 4050 | 6.8% | |
| unknown | 789 | 1.3% |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 7.362373737 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 437325 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 437325 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 437325 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 69861 | 16.0% | |
| e | 52365 | 12.0% | |
| u | 49104 | 11.2% | |
| i | 45387 | 10.4% | |
| o | 38025 | 8.7% | |
| g | 33186 | 7.6% | |
| h | 33186 | 7.6% | |
| f | 30258 | 6.9% | |
| s | 23229 | 5.3% | |
| c | 15129 | 3.5% | |
| t | 15129 | 3.5% | |
| a | 8100 | 1.9% | |
| d | 6246 | 1.4% | |
| r | 6246 | 1.4% | |
| y | 6246 | 1.4% | |
| l | 4050 | 0.9% | |
| k | 789 | 0.2% | |
| w | 789 | 0.2% |
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| spring | |
|---|---|
| shallow well | |
| machine dbh | |
| river | |
| rainwater harvesting | 2295 |
| Other values (5) | 2573 |
| Value | Count | Frequency (%) | |
| spring | 17021 | 28.7% | |
| shallow well | 16824 | 28.3% | |
| machine dbh | 11075 | 18.6% | |
| river | 9612 | 16.2% | |
| rainwater harvesting | 2295 | 3.9% | |
| hand dtw | 874 | 1.5% | |
| lake | 765 | 1.3% | |
| dam | 656 | 1.1% | |
| other | 212 | 0.4% | |
| unknown | 66 | 0.1% |
Length
| Max length | 20 |
|---|---|
| Median length | 11 |
| Mean length | 8.978804714 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| l | 68061 | 12.8% | |
| r | 43342 | 8.1% | |
| e | 43078 | 8.1% | |
| h | 42355 | 7.9% | |
| i | 42298 | 7.9% | |
| a | 37079 | 7.0% | |
| w | 36883 | 6.9% | |
| s | 36140 | 6.8% | |
| n | 33758 | 6.3% | |
| 31068 | 5.8% | ||
| g | 19316 | 3.6% | |
| o | 17102 | 3.2% | |
| p | 17021 | 3.2% | |
| d | 13479 | 2.5% | |
| v | 11907 | 2.2% | |
| m | 11731 | 2.2% | |
| c | 11075 | 2.1% | |
| b | 11075 | 2.1% | |
| t | 5676 | 1.1% | |
| k | 831 | 0.2% | |
| u | 66 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 502273 | 94.2% | |
| Space Separator | 31068 | 5.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| l | 68061 | 13.6% | |
| r | 43342 | 8.6% | |
| e | 43078 | 8.6% | |
| h | 42355 | 8.4% | |
| i | 42298 | 8.4% | |
| a | 37079 | 7.4% | |
| w | 36883 | 7.3% | |
| s | 36140 | 7.2% | |
| n | 33758 | 6.7% | |
| g | 19316 | 3.8% | |
| o | 17102 | 3.4% | |
| p | 17021 | 3.4% | |
| d | 13479 | 2.7% | |
| v | 11907 | 2.4% | |
| m | 11731 | 2.3% | |
| c | 11075 | 2.2% | |
| b | 11075 | 2.2% | |
| t | 5676 | 1.1% | |
| k | 831 | 0.2% | |
| u | 66 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 31068 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 502273 | 94.2% | |
| Common | 31068 | 5.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| l | 68061 | 13.6% | |
| r | 43342 | 8.6% | |
| e | 43078 | 8.6% | |
| h | 42355 | 8.4% | |
| i | 42298 | 8.4% | |
| a | 37079 | 7.4% | |
| w | 36883 | 7.3% | |
| s | 36140 | 7.2% | |
| n | 33758 | 6.7% | |
| g | 19316 | 3.8% | |
| o | 17102 | 3.4% | |
| p | 17021 | 3.4% | |
| d | 13479 | 2.7% | |
| v | 11907 | 2.4% | |
| m | 11731 | 2.3% | |
| c | 11075 | 2.2% | |
| b | 11075 | 2.2% | |
| t | 5676 | 1.1% | |
| k | 831 | 0.2% | |
| u | 66 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 31068 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 533341 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| l | 68061 | 12.8% | |
| r | 43342 | 8.1% | |
| e | 43078 | 8.1% | |
| h | 42355 | 7.9% | |
| i | 42298 | 7.9% | |
| a | 37079 | 7.0% | |
| w | 36883 | 6.9% | |
| s | 36140 | 6.8% | |
| n | 33758 | 6.3% | |
| 31068 | 5.8% | ||
| g | 19316 | 3.6% | |
| o | 17102 | 3.2% | |
| p | 17021 | 3.2% | |
| d | 13479 | 2.5% | |
| v | 11907 | 2.2% | |
| m | 11731 | 2.2% | |
| c | 11075 | 2.1% | |
| b | 11075 | 2.1% | |
| t | 5676 | 1.1% | |
| k | 831 | 0.2% | |
| u | 66 | < 0.1% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| spring | |
|---|---|
| shallow well | |
| borehole | |
| river/lake | |
| rainwater harvesting | 2295 |
| Other values (2) | 934 |
| Value | Count | Frequency (%) | |
| spring | 17021 | 28.7% | |
| shallow well | 16824 | 28.3% | |
| borehole | 11949 | 20.1% | |
| river/lake | 10377 | 17.5% | |
| rainwater harvesting | 2295 | 3.9% | |
| dam | 656 | 1.1% | |
| other | 278 | 0.5% |
Length
| Max length | 20 |
|---|---|
| Median length | 8 |
| Mean length | 9.303602694 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| l | 89622 | 16.2% | |
| e | 66344 | 12.0% | |
| r | 56887 | 10.3% | |
| o | 41000 | 7.4% | |
| s | 36140 | 6.5% | |
| w | 35943 | 6.5% | |
| a | 34742 | 6.3% | |
| i | 31988 | 5.8% | |
| h | 31346 | 5.7% | |
| n | 21611 | 3.9% | |
| g | 19316 | 3.5% | |
| 19119 | 3.5% | ||
| p | 17021 | 3.1% | |
| v | 12672 | 2.3% | |
| b | 11949 | 2.2% | |
| / | 10377 | 1.9% | |
| k | 10377 | 1.9% | |
| t | 4868 | 0.9% | |
| d | 656 | 0.1% | |
| m | 656 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 523138 | 94.7% | |
| Space Separator | 19119 | 3.5% | |
| Other Punctuation | 10377 | 1.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| l | 89622 | 17.1% | |
| e | 66344 | 12.7% | |
| r | 56887 | 10.9% | |
| o | 41000 | 7.8% | |
| s | 36140 | 6.9% | |
| w | 35943 | 6.9% | |
| a | 34742 | 6.6% | |
| i | 31988 | 6.1% | |
| h | 31346 | 6.0% | |
| n | 21611 | 4.1% | |
| g | 19316 | 3.7% | |
| p | 17021 | 3.3% | |
| v | 12672 | 2.4% | |
| b | 11949 | 2.3% | |
| k | 10377 | 2.0% | |
| t | 4868 | 0.9% | |
| d | 656 | 0.1% | |
| m | 656 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 19119 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 10377 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 523138 | 94.7% | |
| Common | 29496 | 5.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| l | 89622 | 17.1% | |
| e | 66344 | 12.7% | |
| r | 56887 | 10.9% | |
| o | 41000 | 7.8% | |
| s | 36140 | 6.9% | |
| w | 35943 | 6.9% | |
| a | 34742 | 6.6% | |
| i | 31988 | 6.1% | |
| h | 31346 | 6.0% | |
| n | 21611 | 4.1% | |
| g | 19316 | 3.7% | |
| p | 17021 | 3.3% | |
| v | 12672 | 2.4% | |
| b | 11949 | 2.3% | |
| k | 10377 | 2.0% | |
| t | 4868 | 0.9% | |
| d | 656 | 0.1% | |
| m | 656 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 19119 | 64.8% | ||
| / | 10377 | 35.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 552634 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| l | 89622 | 16.2% | |
| e | 66344 | 12.0% | |
| r | 56887 | 10.3% | |
| o | 41000 | 7.4% | |
| s | 36140 | 6.5% | |
| w | 35943 | 6.5% | |
| a | 34742 | 6.3% | |
| i | 31988 | 5.8% | |
| h | 31346 | 5.7% | |
| n | 21611 | 3.9% | |
| g | 19316 | 3.5% | |
| 19119 | 3.5% | ||
| p | 17021 | 3.1% | |
| v | 12672 | 2.3% | |
| b | 11949 | 2.2% | |
| / | 10377 | 1.9% | |
| k | 10377 | 1.9% | |
| t | 4868 | 0.9% | |
| d | 656 | 0.1% | |
| m | 656 | 0.1% |
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| groundwater | |
|---|---|
| surface | |
| unknown | 278 |
| Value | Count | Frequency (%) | |
| groundwater | 45794 | 77.1% | |
| surface | 13328 | 22.4% | |
| unknown | 278 | 0.5% |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.08377104 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| r | 104916 | 17.5% | |
| u | 59400 | 9.9% | |
| a | 59122 | 9.9% | |
| e | 59122 | 9.9% | |
| n | 46628 | 7.8% | |
| o | 46072 | 7.7% | |
| w | 46072 | 7.7% | |
| g | 45794 | 7.6% | |
| d | 45794 | 7.6% | |
| t | 45794 | 7.6% | |
| s | 13328 | 2.2% | |
| f | 13328 | 2.2% | |
| c | 13328 | 2.2% | |
| k | 278 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 598976 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| r | 104916 | 17.5% | |
| u | 59400 | 9.9% | |
| a | 59122 | 9.9% | |
| e | 59122 | 9.9% | |
| n | 46628 | 7.8% | |
| o | 46072 | 7.7% | |
| w | 46072 | 7.7% | |
| g | 45794 | 7.6% | |
| d | 45794 | 7.6% | |
| t | 45794 | 7.6% | |
| s | 13328 | 2.2% | |
| f | 13328 | 2.2% | |
| c | 13328 | 2.2% | |
| k | 278 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 598976 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| r | 104916 | 17.5% | |
| u | 59400 | 9.9% | |
| a | 59122 | 9.9% | |
| e | 59122 | 9.9% | |
| n | 46628 | 7.8% | |
| o | 46072 | 7.7% | |
| w | 46072 | 7.7% | |
| g | 45794 | 7.6% | |
| d | 45794 | 7.6% | |
| t | 45794 | 7.6% | |
| s | 13328 | 2.2% | |
| f | 13328 | 2.2% | |
| c | 13328 | 2.2% | |
| k | 278 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 598976 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| r | 104916 | 17.5% | |
| u | 59400 | 9.9% | |
| a | 59122 | 9.9% | |
| e | 59122 | 9.9% | |
| n | 46628 | 7.8% | |
| o | 46072 | 7.7% | |
| w | 46072 | 7.7% | |
| g | 45794 | 7.6% | |
| d | 45794 | 7.6% | |
| t | 45794 | 7.6% | |
| s | 13328 | 2.2% | |
| f | 13328 | 2.2% | |
| c | 13328 | 2.2% | |
| k | 278 | < 0.1% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| communal standpipe | |
|---|---|
| hand pump | |
| other | |
| communal standpipe multiple | |
| improved spring | 784 |
| Other values (2) | 123 |
| Value | Count | Frequency (%) | |
| communal standpipe | 28522 | 48.0% | |
| hand pump | 17488 | 29.4% | |
| other | 6380 | 10.7% | |
| communal standpipe multiple | 6103 | 10.3% | |
| improved spring | 784 | 1.3% | |
| cattle trough | 116 | 0.2% | |
| dam | 7 | < 0.1% |
Length
| Max length | 27 |
|---|---|
| Median length | 18 |
| Mean length | 14.82757576 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| p | 111897 | 12.7% | |
| m | 93632 | 10.6% | |
| n | 87522 | 9.9% | |
| a | 86861 | 9.9% | |
| 59116 | 6.7% | ||
| u | 58332 | 6.6% | |
| d | 52904 | 6.0% | |
| e | 48008 | 5.5% | |
| t | 47456 | 5.4% | |
| l | 46947 | 5.3% | |
| i | 42296 | 4.8% | |
| o | 41905 | 4.8% | |
| s | 35409 | 4.0% | |
| c | 34741 | 3.9% | |
| h | 23984 | 2.7% | |
| r | 8064 | 0.9% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 821642 | 93.3% | |
| Space Separator | 59116 | 6.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| p | 111897 | 13.6% | |
| m | 93632 | 11.4% | |
| n | 87522 | 10.7% | |
| a | 86861 | 10.6% | |
| u | 58332 | 7.1% | |
| d | 52904 | 6.4% | |
| e | 48008 | 5.8% | |
| t | 47456 | 5.8% | |
| l | 46947 | 5.7% | |
| i | 42296 | 5.1% | |
| o | 41905 | 5.1% | |
| s | 35409 | 4.3% | |
| c | 34741 | 4.2% | |
| h | 23984 | 2.9% | |
| r | 8064 | 1.0% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 59116 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 821642 | 93.3% | |
| Common | 59116 | 6.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| p | 111897 | 13.6% | |
| m | 93632 | 11.4% | |
| n | 87522 | 10.7% | |
| a | 86861 | 10.6% | |
| u | 58332 | 7.1% | |
| d | 52904 | 6.4% | |
| e | 48008 | 5.8% | |
| t | 47456 | 5.8% | |
| l | 46947 | 5.7% | |
| i | 42296 | 5.1% | |
| o | 41905 | 5.1% | |
| s | 35409 | 4.3% | |
| c | 34741 | 4.2% | |
| h | 23984 | 2.9% | |
| r | 8064 | 1.0% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 59116 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 880758 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| p | 111897 | 12.7% | |
| m | 93632 | 10.6% | |
| n | 87522 | 9.9% | |
| a | 86861 | 9.9% | |
| 59116 | 6.7% | ||
| u | 58332 | 6.6% | |
| d | 52904 | 6.0% | |
| e | 48008 | 5.5% | |
| t | 47456 | 5.4% | |
| l | 46947 | 5.3% | |
| i | 42296 | 4.8% | |
| o | 41905 | 4.8% | |
| s | 35409 | 4.0% | |
| c | 34741 | 3.9% | |
| h | 23984 | 2.7% | |
| r | 8064 | 0.9% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| communal standpipe | |
|---|---|
| hand pump | |
| other | 6380 |
| improved spring | 784 |
| cattle trough | 116 |
| Value | Count | Frequency (%) | |
| communal standpipe | 34625 | 58.3% | |
| hand pump | 17488 | 29.4% | |
| other | 6380 | 10.7% | |
| improved spring | 784 | 1.3% | |
| cattle trough | 116 | 0.2% | |
| dam | 7 | < 0.1% |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 13.90287879 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| p | 105794 | 12.8% | |
| m | 87529 | 10.6% | |
| n | 87522 | 10.6% | |
| a | 86861 | 10.5% | |
| 53013 | 6.4% | ||
| d | 52904 | 6.4% | |
| u | 52229 | 6.3% | |
| o | 41905 | 5.1% | |
| e | 41905 | 5.1% | |
| t | 41353 | 5.0% | |
| i | 36193 | 4.4% | |
| s | 35409 | 4.3% | |
| c | 34741 | 4.2% | |
| l | 34741 | 4.2% | |
| h | 23984 | 2.9% | |
| r | 8064 | 1.0% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 772818 | 93.6% | |
| Space Separator | 53013 | 6.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| p | 105794 | 13.7% | |
| m | 87529 | 11.3% | |
| n | 87522 | 11.3% | |
| a | 86861 | 11.2% | |
| d | 52904 | 6.8% | |
| u | 52229 | 6.8% | |
| o | 41905 | 5.4% | |
| e | 41905 | 5.4% | |
| t | 41353 | 5.4% | |
| i | 36193 | 4.7% | |
| s | 35409 | 4.6% | |
| c | 34741 | 4.5% | |
| l | 34741 | 4.5% | |
| h | 23984 | 3.1% | |
| r | 8064 | 1.0% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 53013 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 772818 | 93.6% | |
| Common | 53013 | 6.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| p | 105794 | 13.7% | |
| m | 87529 | 11.3% | |
| n | 87522 | 11.3% | |
| a | 86861 | 11.2% | |
| d | 52904 | 6.8% | |
| u | 52229 | 6.8% | |
| o | 41905 | 5.4% | |
| e | 41905 | 5.4% | |
| t | 41353 | 5.4% | |
| i | 36193 | 4.7% | |
| s | 35409 | 4.6% | |
| c | 34741 | 4.5% | |
| l | 34741 | 4.5% | |
| h | 23984 | 3.1% | |
| r | 8064 | 1.0% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 53013 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 825831 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| p | 105794 | 12.8% | |
| m | 87529 | 10.6% | |
| n | 87522 | 10.6% | |
| a | 86861 | 10.5% | |
| 53013 | 6.4% | ||
| d | 52904 | 6.4% | |
| u | 52229 | 6.3% | |
| o | 41905 | 5.1% | |
| e | 41905 | 5.1% | |
| t | 41353 | 5.0% | |
| i | 36193 | 4.4% | |
| s | 35409 | 4.3% | |
| c | 34741 | 4.2% | |
| l | 34741 | 4.2% | |
| h | 23984 | 2.9% | |
| r | 8064 | 1.0% | |
| g | 900 | 0.1% | |
| v | 784 | 0.1% |
status_group
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| functional | |
|---|---|
| non functional | |
| functional needs repair | 4317 |
| Value | Count | Frequency (%) | |
| functional | 32259 | 54.3% | |
| non functional | 22824 | 38.4% | |
| functional needs repair | 4317 | 7.3% |
Length
| Max length | 23 |
|---|---|
| Median length | 10 |
| Mean length | 12.48176768 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 168765 | 22.8% | |
| o | 82224 | 11.1% | |
| i | 63717 | 8.6% | |
| a | 63717 | 8.6% | |
| f | 59400 | 8.0% | |
| u | 59400 | 8.0% | |
| c | 59400 | 8.0% | |
| t | 59400 | 8.0% | |
| l | 59400 | 8.0% | |
| 31458 | 4.2% | ||
| e | 12951 | 1.7% | |
| r | 8634 | 1.2% | |
| d | 4317 | 0.6% | |
| s | 4317 | 0.6% | |
| p | 4317 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 709959 | 95.8% | |
| Space Separator | 31458 | 4.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 168765 | 23.8% | |
| o | 82224 | 11.6% | |
| i | 63717 | 9.0% | |
| a | 63717 | 9.0% | |
| f | 59400 | 8.4% | |
| u | 59400 | 8.4% | |
| c | 59400 | 8.4% | |
| t | 59400 | 8.4% | |
| l | 59400 | 8.4% | |
| e | 12951 | 1.8% | |
| r | 8634 | 1.2% | |
| d | 4317 | 0.6% | |
| s | 4317 | 0.6% | |
| p | 4317 | 0.6% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 31458 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 709959 | 95.8% | |
| Common | 31458 | 4.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 168765 | 23.8% | |
| o | 82224 | 11.6% | |
| i | 63717 | 9.0% | |
| a | 63717 | 9.0% | |
| f | 59400 | 8.4% | |
| u | 59400 | 8.4% | |
| c | 59400 | 8.4% | |
| t | 59400 | 8.4% | |
| l | 59400 | 8.4% | |
| e | 12951 | 1.8% | |
| r | 8634 | 1.2% | |
| d | 4317 | 0.6% | |
| s | 4317 | 0.6% | |
| p | 4317 | 0.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 31458 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 741417 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 168765 | 22.8% | |
| o | 82224 | 11.1% | |
| i | 63717 | 8.6% | |
| a | 63717 | 8.6% | |
| f | 59400 | 8.0% | |
| u | 59400 | 8.0% | |
| c | 59400 | 8.0% | |
| t | 59400 | 8.0% | |
| l | 59400 | 8.0% | |
| 31458 | 4.2% | ||
| e | 12951 | 1.7% | |
| r | 8634 | 1.2% | |
| d | 4317 | 0.6% | |
| s | 4317 | 0.6% | |
| p | 4317 | 0.6% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| id | amount_tsh | date_recorded | funder | gps_height | installer | longitude | latitude | wpt_name | num_private | basin | subvillage | region | region_code | district_code | lga | ward | population | public_meeting | recorded_by | scheme_management | scheme_name | permit | construction_year | extraction_type | extraction_type_group | extraction_type_class | management | management_group | payment | payment_type | water_quality | quality_group | quantity | quantity_group | source | source_type | source_class | waterpoint_type | waterpoint_type_group | status_group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 69572 | 6000.0 | 2011-03-14 | Roman | 1390 | Roman | 34.938093 | -9.856322 | none | 0 | Lake Nyasa | Mnyusi B | Iringa | 11 | 5 | Ludewa | Mundindi | 109 | True | GeoData Consultants Ltd | VWC | Roman | False | 1999 | gravity | gravity | gravity | vwc | user-group | pay annually | annually | soft | good | enough | enough | spring | spring | groundwater | communal standpipe | communal standpipe | functional |
| 1 | 8776 | 0.0 | 2013-03-06 | Grumeti | 1399 | GRUMETI | 34.698766 | -2.147466 | Zahanati | 0 | Lake Victoria | Nyamara | Mara | 20 | 2 | Serengeti | Natta | 280 | NaN | GeoData Consultants Ltd | Other | NaN | True | 2010 | gravity | gravity | gravity | wug | user-group | never pay | never pay | soft | good | insufficient | insufficient | rainwater harvesting | rainwater harvesting | surface | communal standpipe | communal standpipe | functional |
| 2 | 34310 | 25.0 | 2013-02-25 | Lottery Club | 686 | World vision | 37.460664 | -3.821329 | Kwa Mahundi | 0 | Pangani | Majengo | Manyara | 21 | 4 | Simanjiro | Ngorika | 250 | True | GeoData Consultants Ltd | VWC | Nyumba ya mungu pipe scheme | True | 2009 | gravity | gravity | gravity | vwc | user-group | pay per bucket | per bucket | soft | good | enough | enough | dam | dam | surface | communal standpipe multiple | communal standpipe | functional |
| 3 | 67743 | 0.0 | 2013-01-28 | Unicef | 263 | UNICEF | 38.486161 | -11.155298 | Zahanati Ya Nanyumbu | 0 | Ruvuma / Southern Coast | Mahakamani | Mtwara | 90 | 63 | Nanyumbu | Nanyumbu | 58 | True | GeoData Consultants Ltd | VWC | NaN | True | 1986 | submersible | submersible | submersible | vwc | user-group | never pay | never pay | soft | good | dry | dry | machine dbh | borehole | groundwater | communal standpipe multiple | communal standpipe | non functional |
| 4 | 19728 | 0.0 | 2011-07-13 | Action In A | 0 | Artisan | 31.130847 | -1.825359 | Shuleni | 0 | Lake Victoria | Kyanyamisa | Kagera | 18 | 1 | Karagwe | Nyakasimbi | 0 | True | GeoData Consultants Ltd | NaN | NaN | True | 0 | gravity | gravity | gravity | other | other | never pay | never pay | soft | good | seasonal | seasonal | rainwater harvesting | rainwater harvesting | surface | communal standpipe | communal standpipe | functional |
| 5 | 9944 | 20.0 | 2011-03-13 | Mkinga Distric Coun | 0 | DWE | 39.172796 | -4.765587 | Tajiri | 0 | Pangani | Moa/Mwereme | Tanga | 4 | 8 | Mkinga | Moa | 1 | True | GeoData Consultants Ltd | VWC | Zingibali | True | 2009 | submersible | submersible | submersible | vwc | user-group | pay per bucket | per bucket | salty | salty | enough | enough | other | other | unknown | communal standpipe multiple | communal standpipe | functional |
| 6 | 19816 | 0.0 | 2012-10-01 | Dwsp | 0 | DWSP | 33.362410 | -3.766365 | Kwa Ngomho | 0 | Internal | Ishinabulandi | Shinyanga | 17 | 3 | Shinyanga Rural | Samuye | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | swn 80 | swn 80 | handpump | vwc | user-group | never pay | never pay | soft | good | enough | enough | machine dbh | borehole | groundwater | hand pump | hand pump | non functional |
| 7 | 54551 | 0.0 | 2012-10-09 | Rwssp | 0 | DWE | 32.620617 | -4.226198 | Tushirikiane | 0 | Lake Tanganyika | Nyawishi Center | Shinyanga | 17 | 3 | Kahama | Chambo | 0 | True | GeoData Consultants Ltd | NaN | NaN | True | 0 | nira/tanira | nira/tanira | handpump | wug | user-group | unknown | unknown | milky | milky | enough | enough | shallow well | shallow well | groundwater | hand pump | hand pump | non functional |
| 8 | 53934 | 0.0 | 2012-11-03 | Wateraid | 0 | Water Aid | 32.711100 | -5.146712 | Kwa Ramadhan Musa | 0 | Lake Tanganyika | Imalauduki | Tabora | 14 | 6 | Tabora Urban | Itetemia | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | india mark ii | india mark ii | handpump | vwc | user-group | never pay | never pay | salty | salty | seasonal | seasonal | machine dbh | borehole | groundwater | hand pump | hand pump | non functional |
| 9 | 46144 | 0.0 | 2011-08-03 | Isingiro Ho | 0 | Artisan | 30.626991 | -1.257051 | Kwapeto | 0 | Lake Victoria | Mkonomre | Kagera | 18 | 1 | Karagwe | Kaisho | 0 | True | GeoData Consultants Ltd | NaN | NaN | True | 0 | nira/tanira | nira/tanira | handpump | vwc | user-group | never pay | never pay | soft | good | enough | enough | shallow well | shallow well | groundwater | hand pump | hand pump | functional |
Last rows
| id | amount_tsh | date_recorded | funder | gps_height | installer | longitude | latitude | wpt_name | num_private | basin | subvillage | region | region_code | district_code | lga | ward | population | public_meeting | recorded_by | scheme_management | scheme_name | permit | construction_year | extraction_type | extraction_type_group | extraction_type_class | management | management_group | payment | payment_type | water_quality | quality_group | quantity | quantity_group | source | source_type | source_class | waterpoint_type | waterpoint_type_group | status_group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 59390 | 13677 | 0.0 | 2011-08-04 | Rudep | 1715 | DWE | 31.370848 | -8.258160 | Kwa Mzee Atanas | 0 | Lake Tanganyika | Kitonto | Rukwa | 15 | 2 | Sumbawanga Rural | Mkowe | 150 | True | GeoData Consultants Ltd | VWC | NaN | False | 1991 | swn 80 | swn 80 | handpump | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | machine dbh | borehole | groundwater | hand pump | hand pump | functional |
| 59391 | 44885 | 0.0 | 2013-08-03 | Government Of Tanzania | 540 | Government | 38.044070 | -4.272218 | Kwa | 0 | Pangani | Maore Kati | Kilimanjaro | 3 | 3 | Same | Maore | 210 | True | GeoData Consultants Ltd | Water authority | Hingilili | True | 1967 | gravity | gravity | gravity | vwc | user-group | never pay | never pay | soft | good | enough | enough | river | river/lake | surface | communal standpipe | communal standpipe | non functional |
| 59392 | 40607 | 0.0 | 2011-04-15 | Government Of Tanzania | 0 | Government | 33.009440 | -8.520888 | Benard Charles | 0 | Lake Rukwa | Mbuyuni A | Mbeya | 12 | 1 | Chunya | Mbuyuni | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | gravity | gravity | gravity | vwc | user-group | never pay | never pay | soft | good | enough | enough | spring | spring | groundwater | communal standpipe | communal standpipe | non functional |
| 59393 | 48348 | 0.0 | 2012-10-27 | Private | 0 | Private | 33.866852 | -4.287410 | Kwa Peter | 0 | Internal | Masanga | Tabora | 14 | 2 | Igunga | Igunga | 0 | False | GeoData Consultants Ltd | Water authority | NaN | False | 0 | gravity | gravity | gravity | private operator | commercial | pay per bucket | per bucket | soft | good | insufficient | insufficient | dam | dam | surface | other | other | functional |
| 59394 | 11164 | 500.0 | 2011-03-09 | World Bank | 351 | ML appro | 37.634053 | -6.124830 | Chimeredya | 0 | Wami / Ruvu | Komstari | Morogoro | 5 | 6 | Mvomero | Diongoya | 89 | True | GeoData Consultants Ltd | VWC | NaN | True | 2007 | submersible | submersible | submersible | vwc | user-group | pay monthly | monthly | soft | good | enough | enough | machine dbh | borehole | groundwater | communal standpipe | communal standpipe | non functional |
| 59395 | 60739 | 10.0 | 2013-05-03 | Germany Republi | 1210 | CES | 37.169807 | -3.253847 | Area Three Namba 27 | 0 | Pangani | Kiduruni | Kilimanjaro | 3 | 5 | Hai | Masama Magharibi | 125 | True | GeoData Consultants Ltd | Water Board | Losaa Kia water supply | True | 1999 | gravity | gravity | gravity | water board | user-group | pay per bucket | per bucket | soft | good | enough | enough | spring | spring | groundwater | communal standpipe | communal standpipe | functional |
| 59396 | 27263 | 4700.0 | 2011-05-07 | Cefa-njombe | 1212 | Cefa | 35.249991 | -9.070629 | Kwa Yahona Kuvala | 0 | Rufiji | Igumbilo | Iringa | 11 | 4 | Njombe | Ikondo | 56 | True | GeoData Consultants Ltd | VWC | Ikondo electrical water sch | True | 1996 | gravity | gravity | gravity | vwc | user-group | pay annually | annually | soft | good | enough | enough | river | river/lake | surface | communal standpipe | communal standpipe | functional |
| 59397 | 37057 | 0.0 | 2011-04-11 | NaN | 0 | NaN | 34.017087 | -8.750434 | Mashine | 0 | Rufiji | Madungulu | Mbeya | 12 | 7 | Mbarali | Chimala | 0 | True | GeoData Consultants Ltd | VWC | NaN | False | 0 | swn 80 | swn 80 | handpump | vwc | user-group | pay monthly | monthly | fluoride | fluoride | enough | enough | machine dbh | borehole | groundwater | hand pump | hand pump | functional |
| 59398 | 31282 | 0.0 | 2011-03-08 | Malec | 0 | Musa | 35.861315 | -6.378573 | Mshoro | 0 | Rufiji | Mwinyi | Dodoma | 1 | 4 | Chamwino | Mvumi Makulu | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | nira/tanira | nira/tanira | handpump | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | shallow well | shallow well | groundwater | hand pump | hand pump | functional |
| 59399 | 26348 | 0.0 | 2011-03-23 | World Bank | 191 | World | 38.104048 | -6.747464 | Kwa Mzee Lugawa | 0 | Wami / Ruvu | Kikatanyemba | Morogoro | 5 | 2 | Morogoro Rural | Ngerengere | 150 | True | GeoData Consultants Ltd | VWC | NaN | True | 2002 | nira/tanira | nira/tanira | handpump | vwc | user-group | pay when scheme fails | on failure | salty | salty | enough | enough | shallow well | shallow well | groundwater | hand pump | hand pump | functional |